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A second wind for the president 


A lack of leadership has hampered progress against global warming. If Barack Obama earns a 
second term as US president, will he have the energy to tackle climate? 


was every bit as real for scientists as for voters. Early optimism 

gave way before often-nasty political realities, but Obama 
mostly made good on his promises by appointing top-flight research- 
ers to his administration, elevating science within federal agencies and 
making research and development the foundation of a broader vision 
of innovation. As he makes his pitch for another four years, there can 
be little doubt that he intends to continue supporting science. But 
the vision that he laid out four years ago remains a work in progress. 

Nowhere is this clearer than with energy and climate. As discussed 
on page 488, the president has increased investments in research and 
development that may one day help to reduce the cost of clean energy 
technologies and hasten progress on fighting global warming. Regula- 
tions that aim to clean up the atmosphere and protect public health 
are already encouraging a transition towards cleaner energy. Obama 
also deserves credit for staking out a centrist position on energy 
issues, leaving room for both renewables and the smart development 
of domestic fossil-fuel resources. But as the political discussion about 
climate has degenerated to a frightening degree during the past four 
years, he has offered too little leadership on these issues. 

The polarization has its roots in both the politics of the economic 
crisis and the dearth of quick, cheap solutions. Curbing emissions on 
the scale required to avert dangerous warming is a daunting challenge, 
but it is feasible and will be cost-effective in the long term. Shifting to 
cleaner energy would bring benefits not just to climate, but also to air 
quality, infrastructure and public health. But the cleanest sources of 
power cost more up front, and politicians are loath to increase energy 
prices when the economy is struggling. And although the number of 
green-energy jobs is increasing, shutting down old coal-fired power 
plants means job losses — and political protests — on a local scale. 
This has fuelled angry opposition to the modest pollution regulations 
that Obama’ administration has rolled out during his first term. 

Given the toxic political atmosphere surrounding the November elec- 
tions, it is perhaps understandable that the administration, Democrats 
and even some environmentalists are saying little about global warming. 
But by failing to speak out, they have often ceded the airwaves to deniers. 
Although polling shows that almost two-thirds of US citizens support 
some kind of action on global warming, law-makers in Washington DC 
are back to debating the validity of climate science. The United States 
needs leadership that is willing and able to uphold and act on the science. 

If Obama wins a second term, he will need to take on the politi- 
cal opposition and bring apparently disparate interests together. This 
means laying out a clear vision for the future, and making the imme- 
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to think about his legacy instead of about re-election. 

His opponent, Mitt Romney, has a record on global warming that is 
spotty at best. As governor of Massachusetts from 2003 to 2007, Romney 
initially supported, then backed out of, the Regional Greenhouse Gas 
Initiative, a cap-and-trade programme to reduce emissions from elec- 
tricity generation across New England. Last year, he broke with Repub- 
lican ranks and declared global warming a threat, but more recently he 

has questioned its severity. He is now propos- 


“The United ing to roll back the government's regulatory 
States needs powers. If he wins the election, scientists will 
leadership have to work with the new administration and 
that is willing hope that the president will muster the cour- 
and able to age to take on his own party and govern as the 
uphold and centrist he was in Massachusetts, where he 
act on climate introduced a state health-care initiative that 


was the model for Obamas national effort. 

The next round of greenhouse-gas policies 
is likely to consist of targeted measures, unlike the economy-wide cap- 
and-trade approach that failed in 2010. The Obama administration 
has regulated emissions from vehicles and required sharp increases in 
fuel economy; the next big target is electricity generation. Like the car 
industry, utility companies want economic certainty. Environmental - 
ists want emissions reductions. Is there a way to ensure both without 
over-burdening consumers? Or could a carbon tax make a comeback 
as a way to increase revenues and lower income taxes? 

Whatever the next steps, they will require bringing the business 
community together with scientists, environmentalists, workers and 
others. A clever leader might yet be able to build support for viable 
political solutions where seemingly narrow interests converge. m 


science.” 


Therapy deficit 


Studies to enhance psychological treatments 
are scandalously under-supported. 


diagnose. There is no doubt, however, that it represents 
a colossal health burden. According to the World Health 
Organization, it is one of the leading illnesses when ranked by impact 
on productivity, and directly affects 121 million people worldwide. 
As has been discussed in Nature’s pages, the standard research-based 
model of therapy development for psychiatric disorders — target iden- 
tification, drug discovery and clinical investigation — is imploding as 
biologists grapple with the complexities and drug companies see too 
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few prospects to keep investing. Most psychologists agree that the 
drugs already in use are efficacious to a point, but with plenty of room 
for improving their effectiveness and reducing side effects. 

Psychological treatments are the other arm of therapy for depression 
and many psychiatric disorders. The literature is full of trials indicating 
that they are in some contexts more effective and better at preventing 
relapse than pharmaceutical interventions. There are no physical side 
effects, but some of the best-validated treatments depend on therapists 
with training. Unlike drug therapies, there are no multibillion-dollar 
suppliers of these treatments with the motivation to market them and 
make them easily obtainable. 

The money being spent on researching psychological therapies 
is tiny by comparison with budgets for fundamental neuroscience 
and for translation and drug discovery. And yet such therapies have 
the potential to make a substantive difference to new categories of 
patients. What is more, treatments could doubtless be improved in 
their existing applications by deeper investigation of their outcomes. 

This fundamental funding imbalance is compounded by the fact that 
the sources of research funding disfavour studies of psychological treat- 
ments that focus only on efficacy. Nature would be the last to question 
the importance of research for causal or mechanistic insights. But the 
brain’s complexity and relative inaccessibility leaves a gulf between the 
current basic understanding and the observed pathologies of human 
minds. Taking existing psychological therapies and improving and 
extending their psychological models and applications, and developing 
new ones, regardless of the underlying neural processes in the brain, is 
also a justifiable endeavour for funding agencies. Given the immediate 
benefits this research could bring, the deficit of interest in funding it is 
ascandal. 


Take one prominent example. Since 2006, UK governments have 
increasingly committed to a programme called Improving Access 
to Psychological Therapies. This massive, enlightened initiative in 
psychotherapeutic capacity-building has increased the number 
of practising cognitive behavioural therapists by several thousand, 
providing more than 600,000 people with access to these services, 
particularly patients with depression and anxiety disorders. 

Inspect the databases of this programme and 


“Psy chological you find large variations in outcomes between 
therapies have UK regions. A deeper look would allow one to 
the potential examine many factors relating to the effective- 
tomakea ness of such treatments, thereby opening the 
substantive prospect of improvements in the United King- 
difference to dom and elsewhere. 


patients .” As psychologists outside Britain will confirm, 
despite the variability in outcomes, this pro- 
gramme represents a world-beating standard thanks to the scale of its 
implementation and the validation of its treatments by the UK National 
Institute for Health and Clinical Excellence. And yet since its inception, 
the UK government has prevented any of the funds from being used 
for scientific research on the factors underlying its measured impacts. 

This is an extreme example. Many funding agencies around the world 
are too keen solely to support mechanistic investigations with potential 
long-term payoffs, and too unwilling to appreciate that part of their 
portfolio should be oriented towards identifying immediately effective 
psychological interventions. Success in this area would further encour- 
age policy-makers to enhance much-needed access to treatment for psy- 
chiatrically ill individuals. After all, many of these people are taxpayers 
who ultimately fund research into brains and minds. m 


Poison postures 


Researchers working on controversial topics 
must take care how they promote their results. 


last week’s issue of the respectable French news weekly, 

Le Nouvel Observateur. The blatantly fallacious headline 
was the opening salvo in a blitz of media coverage about research 
published online in the journal Food and Chemical Toxicology. The 
paper, from a research group led by Gilles-Eric Séralini, a molecular 
biologist at the University of Caen in France, claimed to show that 
consumption of a genetically modified (GM) maize (corn) that had 
already been approved for animal and human consumption and of the 
herbicide Roundup greatly elevates rates of cancer and other illnesses, 
and causes premature deaths, in rats. 

With such strong claims and the predictably large effect they will have 
on public opinion, researchers should take care how they present their 
findings to the public and the media. They should spell out their results 
clearly; emphasize the limitations and caveats; and make it clear that the 
data still need to be assessed, and replicated, by the scientific community. 

That didn’t happen. The paper was promoted in a public-relations 
offensive, with a related book and film set for release this week. Further- 
more, journalists wishing to report the research had to sign confidenti- 
ality agreements that prevented them from contacting other scientists 
for comment on the paper until after the embargo had expired. Some, 
to their credit, refused, or accepted and then revisited the story critically 
once their hands were no longer tied by these outrageous restrictions. 

The result was the exclusion of critical comment in many of the 
breaking stories — the ones that most people will remember. But much 
criticism has followed (see page 484). In hindsight, journalists who 
agreed to the conditions should have contacted the journal publisher 


cc , es, GMOs are poisons,” screamed the front cover of 
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— in this case, Elsevier — which no doubt would have done its best to 
have made the paper available under embargo without such constraints. 

The embargo system gives journalists advance access to papers 
on the premise that it will give them time to research the story fully 
before presenting it to the public. The system also benefits journals and 
authors because it helps to maximize press coverage, but at its heart, it 
is about helping to improve the quality of science reporting. Crucially, 
the embargo system allows journalists to consult scientists not involved 
in the work before interpreting it for the public. 

The criticisms that followed questioned both the study’s methodol- 
ogy and findings. Given its exceptional claims, the authors of the paper 
now need to make their raw data available so that they can be carefully 
assessed by scientists with appropriate expertise, and the work replicated 
or refuted — thus will the study stand or fall. The European Food Safety 
Agency, and other advisory and regulatory bodies, are planning to assess 
the research, and these assessments should be as transparent as possible, 
so that the public can be confident about whatever they conclude. 

The events of last week also illustrate a long-standing flaw in the 
debate over the safety of GM crops. Many have used last week's publica- 
tion to claim that GM foods are a health risk. But even ifone GM crop 
were to be shown to have serious adverse health effects, that would say 
little about others: the safety of any genetic modification depends on the 
crop and on the particular changes introduced. Scientists who support 
transgenic crops fall into the same trap when they claim that the many 
GM crops that have passed safety tests somehow show that ‘GMOs are 
safe. They should instead be giving the message that GM foods must 
be assessed on a case-by-case basis. 

Too often in the GM-food debate, generalizations and extremism 
lead to sterile public and political discourse that obscures key issues: 
what sorts of GM crops might bring true benefits to agriculture and 
consumers; how to avoid monopolization of 
farming choices; and what types of sustain- 
able agriculture we want in the future. Polar- 
ized debates, not GMOs, are the poison to be 
avoided. m 
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hen it comes to apportioning credit, science could learn 
Wim the movies. Since 1934, the Academy of Motion 

Picture Arts and Sciences, which awards the Oscars, has 
maintained an index to film credits, now called the Motion Picture 
Credits Database. For every film being considered for an award, the 
contribution of each person who worked on it, from hair stylists to the 
lead actress, is documented. Science, for all its focus on assessing and 
quantifying, has nothing like this. 

The main currency in the world of science is authorship. Author- 
ships enable scientists to accumulate citations, which seem to be 
established as the ‘true’ measure of successful and important science. 
Authorships are key to getting grants and winning promotions, and are 
also the foundation for a recently proposed algorithm for predicting 
scientists success — their future h-index (D. E. Acuna et al. Nature 
489, 201-202; 2012). 

Yet authorship is a dubious indicator. Sophis- 
ticated readers know that decoding an author 
list in terms of actual contribution to the scien- 
tific paper is close to impossible, unless you are 
involved yourself. Students find that they have to 
contribute much more to be listed as an author 
than more senior researchers do. Initially, stu- 
dents can feel exploited, but they may get used 
to the system. Eventually, if they choose an aca- 
demic career, they may (more or less willingly) 
adopt the behaviour themselves. Political and 
personal relations also determine whether a con- 
tributor appears in the list of authors or under 
‘Acknowledgements. 

Some disciplines follow conventions in which 
list order reflects the relative importance or type 
of contribution. However, only experienced readers understand the 
conventions, which differ between disciplines and can be difficult to 
apply to papers with many authors. 

For example, a six-page paper on the discovery of the Higgs boson at 
the Large Hadron Collider at CERN, Europe’ particle-physics labora- 
tory near Geneva, Switzerland, was followed by seven pages of author 
names, but readers without a deep understanding of the experiment’s 
organization have no way of deducing the roles of the authors. They 
could not all have written the manuscript, and skilled people not listed 
probably also contributed to the experiments. Simplifying each person's 
contribution to a co-authorship results in an incomplete picture of their 
role in the work. 

Scientific publishers are aware of the problems of authorship, and 
many journals (such as Nature) publish descrip- 


tions — often in broad terms — ofthe contribu- NATURE.COM 
tions of each author. However, these accounts are _ Discuss this article 
mainly intended to keep author lists free of people __ online at: 

who have made no real contribution to the work. —_go.niature.com/qtpehz 
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Online databases could increase fairness and transparency by fully 
documenting the role of each contributor to a paper, says Sebastian Frische. 


They provide little detail on what each author did. And they leave out 
people whose activities — fund-raising or data acquisition, for example 
— are not enough to qualify for authorship, but are still pivotal. 

A better system could soon emerge. Already, databases of scientific 
authors such as ResearcherID.com, BioMedExperts.com and Research- 
Gate.net allow authors to interact, form interest groups and rate each 
others’ publications. ResearchGate has even introduced a new metric 
based on a scientist’s contributions and activity in the network. 

Asa natural next step, these online networks could give the authors 
of a paper an opportunity to publish detailed accounts of each per- 
son's contribution. With a mouse click, each contributor could verify 
what others contributed, resulting in merit lists validated by all par- 
ticipants. A similar, but paper-based, system has existed for decades 
in Denmark, where co-author declarations document what work PhD 
students contributed to the papers included in 
their theses. The rigorous rules for authorship, 
requiring substantial contributions to the design 
of the work, data gathering and writing, would 
no longer be relevant, because each contribu- 
tion could be described and valued. Everyone 
who made the science possible would be prop- 
erly acknowledged, and the word ‘author’ could 
regain its true meaning. 

In the highly competitive film industry, credit 
lists are the basis for the CVs of the diverse pro- 
fessionals needed to produce a blockbuster. A full 
contribution database would offer similar benefits 
to scientists. Contributors could generate updated, 
verified and detailed accounts of their scientific 
merits using a few clicks. Journals, funding agen- 
cies and employers would all come to rely on the 
system. Journals might replace the authors and acknowledgments sec- 
tions in papers with online credit lists, funding agencies could use the 
system to generate validated CVs for the review process and employers 
could find the person with just the right skills and experience for the job. 

The technical platform for such a database could emerge from the 
world of music, software development, film, journalism or any other 
creative industry. With the right technology, writing and approving 
entries in these networks could become everyday routine, just as 
‘liking’ something on Facebook is today. Validated web-based infor- 
mation networks will allow documented skills — not just numbers of 
papers or citation-based indices — to count towards advancement in 
academic life. The transparency and distributed nature of such net- 
works would create a strong incentive for honesty, and fundamentally 
improve the morale and culture of science. m SEE CAREERS P.591 


Sebastian Frische is associate professor of biomedicine at the 
University of Aarhus, Denmark. 
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Cells turn back 
clock in diabetes 


In type 2 diabetes, the 
pancreas’ B cells stop making 
insulin because they revert to 
progenitor cells, rather than 
because they die, as has been 
thought. 

A team led by Domenico 
Accili at Columbia University 
in New York studied mice 
lacking the Foxo1 gene, which 
is involved in cell specialization 
or differentiation, in 
their cells. The animals 
produced fewer f cells and 
developed high blood- 
sugar levels. Moreover, the 
authors found that the 6 cells 
reverted back to endocrine 
progenitor cells, which are 
unable to make insulin. 

Mice in two other models of 
diabetes also produced such 
de-differentiated B cells. 

Turning these stem cells 
back into 6 cells could be a way 
to treat type 2 diabetes, the 
authors suggest. 

Cell 150, 1223-1234 (2012) 


Fragile-X drug in 
humans and mice 


A drug called arbaclofen has 
shown potential for relieving 
symptoms of fragile X 
syndrome — a genetic disease 
that causes intellectual 
disability and autism — 
according to a clinical and an 
animal study published back- 
to-back. 

Fragile X syndrome is 
caused by a faulty gene, 
which leads to excessive 
protein synthesis at neuronal 
connections, or synapses, in 
the brain, and to exaggerated 
excitatory neurotransmission. 
Two teams led by scientists 
from Seaside Therapeutics in 
Cambridge, Massachusetts, 
investigated whether 


Selections from the 
scientific literature 


ECOLOGY 


Artificial marshes fall short 


Kingdom. The researchers found that the 
natural habitats were home to many plants, 
including sea lavender (Limonium vulgare, 
pictured left), thrift (Armenia maritima) and 
sea plantain (Plantago maritima), that were 
often absent in replica habitats (pictured 


Man-made salt marshes have less biodiversity 
than similar naturally occurring habitats, and so 
are failing to meet European regulations. 
Natural salt marshes lost to coastal 
development or erosion must be replaced 
with biologically equivalent replicas under 
European Union law. Hannah Mossman at the 
University of East Anglia in Norwich, UK, and 
her team compared the vegetation at 35 salt 
marshes that had been man-made or created 
by storm surges with that at 34 naturally 
occurring marshes across the United 


arbaclofen — which promotes 
inhibitory neurotransmission 
by activating GABA, receptors 
— could mitigate symptoms. 
Paul Wang and his 
colleagues ran a placebo- 
controlled clinical trial in 
63 people with fragile X. They 
found no effect on irritability, 
but social function improved 
in 27 people who had severe 
social impairment. 
Ina study of mice with 
the condition, Aileen Healy 
and her colleagues found 
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right). 


that the drug corrected the 
excessive protein synthesis 
and associated synaptic 
abnormalities. 

Sci. Transl. Med. 4,152ra127; 
152ra128 (2012) 


Pulsating tubes 
act as pumps 


Nanometre-scale tubules made 
of self-assembled molecules 
have been designed to expand 


The authors suggest that management of 
artificial marshes should be improved using 
measures such as planting extra species. 

J. Appl. Ecol. http://dx.doi.org/10.1111/j.1365- 
2664.2012.02198.x (2012) 


and contract with changing 
temperature, pumping 
molecules out of their interior. 
Nanotubes are normally 
too rigid to allow major 
structural changes, thanks 
to the bonds that hold their 
shape. Myongsoo Lee at 
Seoul National University 
and his colleagues created 
pulsating nanotubules from 
bent carbon-based chains 
that stack together in water. 
The ‘arms’ of the chains 
overlap to form staggered 
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hexagonal rings that then 
assemble into a tube. When 
the temperature rises or falls, 
the segments slide against each 
other and the tubule expands 
or contracts, undergoing 
extensive structural changes 
but maintaining its hollow 
arrangement. 

Science 337, 1521-1526 (2012) 


Symbiosis may 
fertilize seas 


An unusual partnership 
between single-celled algae 
and cyanobacteria may 
provide a key source of 
nitrogen for marine life, and 
could shed light on how plants 
evolved to have photosynthetic 
organelles. 

Jonathan Zehr at the 
University of California, 
Santa Cruz, and his colleagues 
investigated a widespread 
cyanobacterium that lacks 
certain genes involved 
in photosynthesis and 
metabolism but can convert 
atmospheric nitrogen into 
forms that other organisms 
can use. Zehr and his team 
sampled sea water (pictured) 
to look for symbiotic partners 
of the bacterium, and founda 
kind of unicellular alga. The 
bacterium attaches to the cell 
wall of the alga, and provides it 
with nitrogen in exchange for 
carbon. 

The symbiosis is probably 
crucial to global carbon 
and nitrogen cycles, and 
is analogous to the early 
stages of an evolutionary 
process in which plants 
acquired microbes that 
would eventually become 
chloroplasts, the organelles 
that perform photosynthesis. 
Science 337, 1546-1550 (2012) 


CLIMATE CHANGE 


Arctic snow lost 
faster than ice 


The rate of loss of Arctic 
snow cover during spring 
is outpacing even the rapid 
decline of sea ice. 

Chris Derksen in Toronto 
and Ross Brown in Montreal, 
both of Environment Canada, 
looked at satellite data on 
high-latitude snow cover 
between April and June since 
1979, and compared it with 
observed changes in the extent 
of Arctic sea ice in the same 
period. In most Arctic land 
areas, snow cover remaining in 
June had decreased by almost 
20% per decade. Arctic sea ice, 
which this summer melted to 
anew record low, has declined 
by about 10% per decade since 
1979. 

State-of-the-art climate 
models have considerably 
underestimated the 
accelerating downward trend 
in both snow cover and sea-ice 
extent since 2005, the authors 
note. 

Geophys. Res. Lett. http://dx.doi. 
org/10.1029/2012GL053387 
(2012) 


Salamanders heal 
like embryos grow 


The Mexican axolotl 
salamander, known for its 
ability to regrow lost limbs, 
may heal by reprogramming 
cells in the stumps of the 
amputated limbs to mimic 
developing embryonic cells. 
Tony Hunter at the Salk 
Institute for Biological 
Studies in La Jolla, California, 
and his colleagues found 
that limb amputation in 
the axolotl (Ambystoma 
mexicanum, pictured) 
triggered expression of two 
genes, PL1 and PL2 — which 
are active in sperm and 
egg cells and in developing 
embryos — in the new 
outgrowth. The same 
genes were not stimulated 
near superficial wounds. 
Suppressing production of 
the PL1 and PL2 proteins 
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A peek at organelles in live cells 


> HIGHLY READ 
on WWW.pnas.org 
in August 


Small molecules can be used for high- 
resolution fluorescence imaging of 
membrane-bound organelles in living cells. 


A team led by Guo-Qiang Bi at the 
University of Science and Technology of China in Hefei 
and Xiaowei Zhuang at Harvard University in Cambridge, 
Massachusetts, developed a fluorescence-imaging method 
that uses common membrane probes. The researchers 
showed that the fluorescence of eight such probes can be 
switched on and off in living cells in normal culture medium, 
enabling high-resolution imaging. Using the probes, the 
researchers were able to obtain images of key organelles, 
including the power-generating mitochondria, with a spatial 
resolution of 30-60 nanometres and a temporal resolution of 
1-2 seconds. This allowed them to capture previously unseen 
details about the dynamic membranes of the organelles, such 
as structures involved in the fusing of mitochondria. 
Proc. Natl Acad. Sci. USA 109, 13978-13983 (2012) 


increased cell 

death in the developing 
limb and slowed its growth. 
Lower levels of the proteins 
were also linked to decreased 
levels of FGF8, a molecule 
important in normal limb 
development. 

The findings could inform 
strategies for regenerating 
tissue in humans, the 
authors say. 

Dev. Biol. 370, 42-51 (2012) 


BIOCHEMISTRY 


Enzyme design 
inspired by cancer 


Mutations found in human 
cancer cells have been used to 
make enzymes that catalyse a 
reaction important in nylon 
production. 


Hai Yan and his colleagues 
at Duke University in 
Durham, North Carolina, 
identified stretches of amino 
acids from oxidoreductase 
enzymes with cancer- 
associated mutations. These 
mutations change the 

chemical reaction that the 

enzymes carry out on their 

substrates, molecules with 
a backbone of five carbon 
atoms. The researchers 
introduced these amino-acid 
residues into the active sites 
of related enzymes that act 
on substrates containing six 
carbon atoms. The resulting 
enzyme performed the desired 
cancer-associated reaction 
on six-carbon molecules, and 
could catalyse a reaction in 
the synthesis of adipic acid, a 
precursor of nylon. 

The authors suggest that 
data from sequenced cancer 
genomes could be used to 
increase functional diversity 
in enzymes. 

Nature Chem. Biol. http://dx.doi. 
org/10.1038/nchembio.1065 
(2012) 
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| BUSINESS 
Database case 


The American Chemical 
Society (ACS), the 

world’s largest scientific 
society, is likely to have 

to pay US$11.5 million in 
compensation to a small 
start-up company after a 
ruling by the Ohio Supreme 
Court on 18 September. The 
court ruled that the ACS, 
based in Washington DC, filed 
a lawsuit to unfairly suppress 
chemical-informatics firm 
Leadscope of Columbus, Ohio. 
See page 482 for more. 


Drug-makers unite 


Ten giant pharmaceutical 
companies have formed a 
non-profit organization called 
TransCelerate BioPharma, 
with the goal of making 
clinical trials more efficient. 
By setting universal standards 
in trial design and data 
collection, the group could 
shave time and expense off the 
drug-development process, 
says Garry Neil, head of 
TransCelerate and a former 
vice-president of science and 
technology at Johnson & 
Johnson. Smaller companies 
are invited to join the team. See 
go.nature.com/jj2sky for more. 


Stem-cell funds 


The European Parliament's 
legal committee has 
recommended that research 
involving human embryonic 
stem cells should not be 
funded in the European 
Union's upcoming Horizon 
2020 research programme. 
The committee refers to a 
decision by the European 
Court of Justice, which 
ruled on ethical grounds 
last October that treatments 
based on such cells were 

not patentable (see Nature 
480, 310-312; 2011). 
Horizon 2020 is intended to 


First light for dark-energy lens 


A camera designed to hunt for signs of dark 
energy, a force thought to be responsible for 
the Universe’s rapid expansion, snapped its 
first images on 12 September. The images 
from the Dark Energy Survey camera (Nature 
489, 190-191; 2012), which is mounted on 


promote Europe’s economic 
competitiveness, and the 
parliamentary committee 
argues that research that 
cannot be patented should 
not be supported under the 
programme. 


Ethics hub 

A US$1.5-million online 
resource for scientists teaching 
responsible conduct of 
research (RCR) was officially 
launched on 20 September. 
The Ethics CORE website 
(www.nationalethicscenter. 
org) is funded by the US 
National Science Foundation 
(NSF) and directed by 

C. K. Gunsalus, an expert 

on research integrity at 

the University of Ilinois at 
Urbana-Champaign. The 
America COMPETES Act 

of 2007 made RCR training 
mandatory for all NSF grant 


478 | NATURE | VOL 489 | 27 SEPTEMBER 2012 
© 2012 Macmillan Publishers Limited. All rights reserved 


recipients; the foundation 
implemented the requirement 
in 2010. See go.nature.com/ 
m38bnp for more. 


Rainforest threat 


Deforestation is on the 

rise again in the Brazilian 
Amazon. The amount of 
clear-cut land hit 522 square 
kilometres in August 2012, 
up from 163 km/ in the same 
month last year, according to 
satellite measurements made 
by Brazil’s National Institute 
for Space Research (INPE), 


based in Sao José dos Campos. 


Deforestation had been on 
the wane since April 2011. 
See go.nature.com/b7ej2p 
for more. 


Immigration bill 
The US House of 
Representatives rejected a bill 
on 20 September that would 


the 4-metre Blanco telescope in Chile, were 
generated as part of a testing phase that is set to 
end in December. This stitched-together image 
of sections of sky shows the globular star cluster 
47 Tucanae, which is around 5,000 parsecs from 
Earth. See go.nature.com/ya5y2p for more. 


have granted permanent 
residency — commonly 
referred to as ‘green card’ status 
— to 55,000 foreign scientists 
and engineers with advanced 
degrees from US universities 
each year. Both presidential 
candidates have expressed 
support for admitting more 
workers in these disciplines, 
but the bill failed to garner the 
two-thirds majority needed to 
pass. See go.nature.com/hatvjq 
for more. 


Arsenic in rice 


Two US reports have 
reignited worries about 
arsenic poisoning from rice 
and prompted politicians 

to introduce legislation on 

21 September that would 
limit the toxic substance. On 
19 September, the consumer 
protection group Consumers 
Union, based in Yonkers, New 


DARK ENERGY SURVEY COLLABORATION/FERMILAB 
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York, and the US Food and 
Drug Administration released 
reports that found inorganic 
arsenic in rice products. The 
presence of the substance is 
probably the result of the use 
of arsenic-based pesticides in 
cotton fields that were later 
used for rice farming. See 
go.nature.com/puezou 

for more. 


PEOPLE 


Development boost 
Chinese President Hu 

Jintao announced a US$1.5- 
million donation to TWAS, 
the academy of sciences 

of the developing world, 

on the opening day of the 
organization's 23rd general 
meeting in Tianjin, China, 
on 18 September. The 
donation means that China 
overtakes Brazil as the 
largest contributor to the 
organization, which is based 
in Trieste, Italy. Chemist Bai 
Chunli, current president 

of the Chinese Academy of 
Sciences, was also elected as 
the organization's president. 


AZT-maker dies 


Jerome Horwitz, the chemist 
who inadvertently created the 
first antiretroviral drug for 
AIDS, azidothymidine (AZT), 
died on 6 September, aged 93. 
News of his death emerged 
last week. Horwitz (pictured) 
created AZT in the 1960s to 
combat cancer, but he shelved 


TREND WATCH 


Arctic sea-ice coverage reached 


the lowest point of this year’s 


summer melt on 16 September, 


the US National Snow and 
Ice Data Center in Boulder, 


Colorado, has declared. On that 
date, around 3.41 million square 
kilometres of sea in the Arctic 
were at least 15% covered in ice, 
under the measurement system 
used by the data centre. The 
new low is by far the smallest 
figure seen in 33 years of satellite 
measurements — and possibly 


the lowest late-summer ice extent 


in more than 5,000 years. 


the drug after it showed little 
efficacy against the disease. 

A company that later became 
part of GlaxoSmithKline 
patented AZT in the 1980s 
when it was found to be 
effective against AIDS. 
Horwitz received no money 
for his discovery because he no 
longer owned the compound, 
but he did go on to produce 
treatments for diseases such 
as cancer at the Wayne State 
University School of Medicine 
in Detroit, Michigan, until he 
retired in 2005. 


Fraud inquiry 

Cardiff University is 
launching an investigation 
into allegations of scientific 
misconduct in the laboratory 
of its dean of medicine, Paul 
Morgan, a spokeswoman for 
the UK institution confirmed 
on 17 September. The 
allegations — that images have 
been manipulated in at least 
six of the laboratory's research 
papers — appeared on the 
website science-fraud.org 


SHRINKING ARCTIC SEA ICE 


in July, and the university 
launched an initial screening 
panel on 6 August. The panel 
recommended the formal 
investigation. 


Chimp research cut 
The US National Institutes 

of Health (NIH) is ending 

its funding for chimpanzee 
work at the largest centre for 
such research that it supports. 
The agency will retire 

110 chimpanzees from the 
New Iberia Research Center, 
part of the University of 
Louisiana at Lafayette, it said 
on 21 September. There are 
still 308 chimpanzees available 
for invasive experiments at 
two other NIH-supported 
centres; the agency says that 
these animals are sufficient 
for dwindling research needs. 
See go.nature.com/8mkgnf for 
more. 


Cancer ambitions 

A leading US cancer centre 
said last week that it will 

spend up to US$3 billion 

over the next ten years ona 
programme to significantly 
increase the odds of surviving 
eight cancers. Under the Moon 
Shots initiative, research teams 
at the MD Anderson Cancer 
Center in Houston, Texas, will 
focus on cancer of the prostate, 
lung and ovary, and on a type 
of breast cancer as well as 
melanoma, two leukaemias 


Sea-ice coverage in the Arctic bottomed out this week 
at the lowest extent ever measured by satellites. 
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and a related blood syndrome. 
See go.nature.com/lrevbl for 
more. 


XMRV ruled out 
The retrovirus XMRV is not 
linked to chronic fatigue 
syndrome (CFS), a definitive 
study that cost US$2.3 million 
concluded on 18 September. 
A 2009 paper found signs of 
XMRYV infection in people 
with CFS (V. C. Lombardi 

et al. Science 326, 585-589; 
2009), but was retracted in 
2011 because of concerns 
that the results were caused 
by contamination. The 

latest study (H. J. Alter et al. 
mBIO 3, e00266-12; 2012) 

is co-authored by members 
of two teams that reported 
associations between XMRV 
and related viruses and CFS. 
See go.nature.com/vadrb7 
for more. 


EVENTS 


Arctic drilling stops 


Plans to drill for oil and gas 
resources off the coast of 
Alaska have been abandoned 
following damage to oil 
containers on the spill-clean-up 
barge Arctic Challenger, oil 
company Shell announced on 
17 September. The setback 
means that the mission 

no longer meets the safety 
requirements for a permit to 
drill specific wells. Shell will 
continue to bore exploratory 
‘top holes’ in the Chukchi Sea in 
preparation for further drilling, 
which has now been delayed 
until 2013. See go.nature.com/ 
zynxtq for more. 


CORRECTIONS 

The story ‘Yellow river turns 
red’ (Nature 489, 182; 
2012) should have termed 
the Yangtze the golden 
river. And the story ‘Murder 
plea’ (Nature 489, 341; 
2012) wrongly located the 
Madison County circuit 
court. It should have been in 
Huntsville, Alabama. 


> NATURE.COM 
For daily news updates see: 
www.nature.com/news 
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Olga Zelenina, seen in the dock at a hearing, is being held in prison in Moscow on a charge of abetting drug trafficking. 


Researchers campaign to 
free jailed Russian chemist 


Narcotics expert Olga Zelenina recognized as a political prisoner. 


BY QUIRIN SCHIERMEIER 


ussian scientists have reacted with 
Ra to the jailing of a colleague, 

narcotics expert Olga Nikolaevna 
Zelenina, on charges of aiding drug traffick- 
ing. Researchers believe that Zelenina has been 
falsely accused, and fear that the free pursuit 
of science in Russia is in jeopardy. “With state 
repression being a recurring issue in this coun- 
try, the outcome of this case is very important 
for us all; says Mikhail Gelfand, a biologist at 
the Institute for Information Transmission 
Problems in Moscow. 


As Nature reported last week (Nature 
http://doi.org/jd5; 2012), Zelenina, an ana- 
lytical chemist at the Penza Agricultural Insti- 
tute in Russia, was arrested at her home on 
15 August by officials from the Russian Fed- 
eral Drug Control Service (FDCS), who were 
accompanied by masked and armed members 
ofa special police unit. Five days later, a judge 
at Moscow’s Zyuzino District Court ruled that 
Zelenina would be detained until 15 October, 
pending a trial. 

Zelenina seems to have been targeted 
because ofa scientific report that she prepared 
in September 2011 for the defence attorneys of 


Sergey Shilov, a Russian businessman under 
investigation by the FDCS. As an expert on 
narcotic compounds in hemp and poppy, 
Zelenina was asked to assess the concentra- 
tion of opiates in a 42-tonne shipment of food 
poppy seeds that Shilov had imported from 
Spain in 2010. Russia banned cultivation of 
the opium poppy (Papaver somniferum) in 
1987, but the seeds can still be imported for 
use in foods including bread or cake, as long as 
they are free of alkaloids such as morphine and 
codeine. The seeds themselves do not contain 
these compounds, but other parts of the plant 
do, and can contaminate shipments. } 
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> Inher analysis, Zelenina found that the 
concentration of opium alkaloids in Shilov’s 
seed shipment was negligible, and that there 
was no scientific evidence that the seeds had 
been imported for the purpose of extracting 
illegal drugs. Her report also stated that it is 
technically impossible to completely eliminate 
these alkaloids from batches of poppy seeds. 

Her findings apparently did not please the 
FDCS. Although a Moscow court ruling on 
24 September raised hopes that Zelenina’s 
case could be heard sooner than October, she 
remained in custody as Nature went to press. 
For now, Zelenina shares a cell with activist 
Nadezhda Tolokonnikova, one of three mem- 
bers of the Russian punk band Pussy Riot who 
were last month sentenced to two years in jail 
on charges of hooliganism. Their crime was 
singing a song critical of Russian president 
Vladimir Putin in Moscow’s Cathedral of 
Christ the Saviour. 

Zelenina’s “is the most outrageous case 
I’ve heard of so far’, says Ivan Gololoboy, a 


sociologist at the University of Warwick in 
Coventry, UK, who has read the Russian- 
language report that was used to make the 
case against the chemist. Gololobov notes that 
Russia often restricts speech and art that are 
critical of the government. “But the case of 
Zelenina, an expert who expressed independ- 
ent opinions and arrived at scientific conclu- 
sions different from those of the prosecutors, 
is a step further,’ he says. “She stands accused 
not for oppositional activity or for her political 
opinions, but simply because of her independ- 
ent professional expertise. If proven guilty, she 
is facing a severe sentence.’ A conviction could 
carry a sentence of several years in prison. 

Natalia Andreeva, Zeleninas Moscow-based 
lawyer, says that the case is reminiscent of that 
of botanist and geneticist Nikolai Vavilov. 
He was jailed in 1940 and died in prison 
three years later, having challenged the non- 
Mendelian doctrines of heredity propagated 
at the time by Trofim Lysenko and backed by 
Joseph Stalin. 


Last month, the Russian government's 
human-rights commissioner, Vladimir Lukin, 
appealed against Zelenina’s detention without 
success. Zelenina is now recognized as a politi- 
cal prisoner by the Union of Solidarity, a Rus- 
sian human-rights watchdog. And last week, 
Russian scientists launched an online petition 
campaigning for her immediate release (go. 
nature.com/ihxmxu). 

The case reveals the inferior status of profes- 
sional science in modern Russia, says Mikhail 
Feigel’man, a physicist at the Landau Institute 
for Theoretical Physics in Moscow. “There is 
a lot of talk going on here these days that the 
absence of a functioning system of techno- 
logical innovation and professional scientific 
expertise is one of the big problems in this 
country,’ he says. “These qualities have indeed 
all but vanished — and Olga’s case makes me 
fear that they will not come back any time 
soon. Scientists in Russia will understand that 
what they are doing is more likely to get them 
in trouble than do anything good for society.’ m 


INTELLECTUAL PROPERTY 


Chemical society tried to 
block business competitor 


Court ruling raises concerns among some American Chemical Society members. 


BY EUGENIE SAMUEL REICH 


r | The world’s largest scientific society 
tried to unfairly obstruct a tiny entre- 
preneurial start-up from competing 

with its most lucrative scientific information 

product. That is the main conclusion of a 

ten-year legal battle that leaves the American 

Chemical Society (ACS) facing millions 

of dollars in penalties and sharp questions 

from some of its members about whether its 
business practices conflict with its mission to 
advance the chemical sciences. 

The 18 September ruling by the Ohio 
Supreme Court focuses on the society's cash 
cow: the Chemical Abstracts Service (CAS), 
which offers access to a database of 68 million 
chemical compounds, indispensable to many 
chemists the world over. By typing in the name 
of a chemical or submitting a drawing of its 
structure, researchers can access reams of data 
about the molecule’s properties, or a list of aca- 
demic papers referring to it. According to court 
documents in the case, CAS makes up about 
60% of the society’s gross revenue, amounting to 
some US$300 million a year. By contrast, mem- 
bership payments raise only about $12 million 


US$500m 


(see ‘Chemical caslv). The society, a non-profit 
organization based in Washington DC, ploughs 
the cash back into programmes for chemistry 
education, academic meetings and improve- 
ments in chemical-information resources. 

In 2002, the ACS sued Leadscope, a 
chemical-informatics firm in Columbus, 
Ohio. The society alleged that the company 
had infringed ACS intellectual property when 
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CHEMICAL CASH 


The American Chemical Society earns more than 
half of its income from the Chemical Abstracts 
Service (CAS). That sum overshadows recent court 
damages awarded against the society. 


@ Electronic services $404 million 

@ of which CAS $300 million 

| Petroleum research fund $21 million 
Membership fees $12 million 

™ Member insurance $11 million 

@ Conferences $10 million 

@ Advertising $9 million 

@ Investments $10 million 

@ Other $23 million 


Damages $11.5 million 
Not including legal fees and interest 


the firm obtained a patent on software to 
display correlations between structural 
features of chemical compounds and their 
biological and physical properties. 

The ACS argued that Leadscope’s prod- 
uct was based on an aborted project called 
PathFinder, which two of three Leadscope 
co-founders had worked on in the 1990s 
when they were employed by CAS, also based 


SOURCE: ACS 
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in Columbus. A trial court and an 
appeals court in Ohio both ruled 
against the ACS, but the society 
requested that the case be reviewed 
by the Ohio Supreme Court. That 
court has now refuted the society’s 
allegations, and has instead upheld 
counterclaims by Leadscope that 
the ACS filed an “objectively base- 
less” lawsuit to unfairly quash a 
competitor. 

The ruling means that the ACS 
will probably have to pay Leadscope 
$11.5 million in compensation — the 
amount awarded by the trial court in 
2008 and upheld by the appeals court 
in 2010. However, the ruling clears 
the ACS of defaming Leadscope, a 
charge that lower courts had upheld, 
letting the society off the hook for 
a further $15 million in damages. 
The Supreme Court found that, although the 
ACS did make false public statements about 
Leadscope infringing its intellectual property, 
those statements were an accurate summary of 
allegations already mentioned in court docu- 
ments and which the public had a legitimate 
interest in knowing about. 

Both the ACS and a lawyer for Leadscope 
declined to give interviews because the 
Supreme Court ruling must still be imple- 
mented by the original trial court in Frank- 
lin County, Ohio. The society has previously 
said that part or all of the court’s award may be 
covered by its legal insurance. In a statement 
responding to the Supreme Court's verdict, the 
ACS said that the ruling “will not impact ACS 
member dues; ACS products, programs or ser- 
vices; ACS staffing levels; or the ability of ACS 
to achieve its mission”. 

But some of the society’s 164,000 or so 
members, who include chemists from academia, 
industry and government, are not convinced. 
“T believe that this lawsuit has brought damage 
to the society, not only financially, but also repu- 
tationally,’ says Steven Bachrach, a chemist at 
Trinity University in San Antonio, Texas, an 
ACS member who has followed the case. “?'m 
concerned about the standing of the society in 
pursuing this aggressively after losing twice.” 

Richard Apodaca, an ACS member who is 
founder of Metamolecular, a chemical-soft- 
ware company in La Jolla, California, says that 
as a software entrepreneur, he sympathizes 


> 
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A virtual analogue to the molecule libraries in industrial labs (pictured), 
the Chemical Abstracts Service contains data on millions of compounds. 


with Leadscope. “I was shocked the ACS was 
being predatory,’ he comments. “I felt kind of 
ashamed to be an ACS member and for the ACS 
to be doing this.” Court documents state that in 
2002, Leadscope lost an important venture-cap- 
ital deal because the ACS lawsuit scared off the 
investor, and that a financing deal the company 
later obtained was on less-favourable terms. 

Apodaca says that, like any organization, 
the ACS has the right to defend its intellectual 
property, and that it is increasingly common 
for large software companies to pursue com- 
petitors who they believe may have stolen their 
products. But in this case, the court found that 
the ACS had no evidence of Leadscope copy- 
ing source code from any software developed 
at CAS. Experts from Leadscope and the ACS, 
who had compared the source codes for Lead- 
scope’s product and PathFinder line-by-line, 
both testified in court that the products were 
completely different. The Supreme Court 
ruling says that although PathFinder and the 
Leadscope product perform similar functions, 
the idea of developing such tools could not be 
considered a trade secret of the ACS. 

The court rulings in favour of Leadscope 
emphasized the actions of Robert Massie, a 
lawyer and businessman who has been presi- 
dent of CAS since 1992. The court referred to 
evidence suggesting that before the ACS filed 
its lawsuit, Massie had been monitoring Lead- 
scope closely and was aware that it was run- 
ning out of money. Further evidence showed 
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that he had e-mailed the governor of 
Ohio's office to try to dissuade him 
from visiting Leadscope; and that 
the then-director of information 
technology for CAS had told Massie 
that he simply did not know whether 
Leadscope’s product infringed the 
ACS’s intellectual property. Massie 
did not respond to an e-mail from 
Nature drawing his attention to the 
court’s remarks and asking for com- 
ment. 

ACS v. Leadscope is the most recent 
example of the society’s efforts to pro- 
tect its intellectual property. In 2005, 
the ACS accused the US National 
Institutes of Health of unfair competi- 
tion in pioneering PubChem, a data- 
base linking chemical compounds 
with their biological activity, although 
that dispute has never reached court 
(see Nature 435, 718-719; 2005). In 2004, the 
society sued Google, claiming that the name 
(but not the content) of its free Google Scholar 
service infringed on the ACS’s trademarked 
SciFinder Scholar software. That case was set- 
tled out-of-court in 2006 on confidential terms. 

Donald Lewis, a patent lawyer based in San 
Diego, California, who isa member of the ACS 
Division of Chemistry and the Law, thinks that 
the society's pursuit of Leadscope runs counter 
to its goal to advance chemistry — something 
on which the society’s status as a non-profit, 
non-tax-paying organization depends. By 
developing a chemical-information product 
that is useful to researchers, Leadscope was 
arguably doing its bit to improve chemistry 
too, Lewis suggests. “In my opinion, the Lead- 
scope case illustrates an instance where the 
ACS’s business interests were pursued by its 
professional management to the detriment 
of the advancement of chemical sciences,’ he 
says. “I’m interested in the soul of the ACS” 

“In my view, this is not a fair representation 
of our members’ opinions,’ Madeleine Jacobs, 
chief executive of the ACS, wrote in an e-mail 
to Nature. She declined to comment on the 
recent court ruling, but emphasized that the 
society spends its revenues on fulfilling its 
mission through programmes to improve 
chemists’ qualifications and the scientific 
literacy of the public, and by providing 
“the most accurate, most trusted, and most 
in-depth chemical information in the world”. m 


| MORE NEWS | 
Custom @ African neighbours divided by their How to 
gene editing | genes go.nature.com/pswajh pusha 
rewrites @ Amazon fire analysis hits new moral 
zebrafish heights go.nature.com/siw3jn compass 
DNA @ Bacteria replicate close to the off course 
go.nature.com/ physical limit of efficiency go.nature.com/ go.nature.com/ 
nmhspe fqyd6k allnf 
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TOXICOLOGY 


Rat study sparks GM furore 


Cancer claims put herbicide-resistant transgenic maize in the spotlight. 


BY DECLAN BUTLER 


urope has never been particularly fond 
of genetically modified (GM) foods, but 


a startling research paper published last 
week looks set to harden public and political 
opposition even further, despite a torrent of 
scepticism from scientists about the work. 

The study’, published in the peer-reviewed 
journal Food and Chemical Toxi- 
cology, looked for adverse health 
effects in rats fed NK603 maize 
(corn), developed by biotech com- 
pany Monsanto to resist the herbi- 
cide glyphosate and approved for 
animal and human consumption 
in the European Union, United 
States and other countries. It 
reported that the rats developed 
higher levels of cancers, had larger 
cancerous tumours and died 
earlier than controls. The research- 
ers have not conclusively identified 
a mechanism for the effect. 

The rats were monitored for two 
years (almost their whole lifespan), 
making this the first long-term 
study of maize containing these 
specific genes. About a dozen long- 
term studies of different GM crops 
have failed to find such stark health 
effects”. An earlier test of NK603 maize in rats 
in a 90-day feeding trial — the current regula- 
tory norm — sponsored by Monsanto showed 
no adverse effects’. 

The explosion of media coverage about the 
findings has energized opponents of GM food, 
especially in Europe. French Prime Minister 
Jean-Marc Ayrault said that, if the results are 
confirmed, the government will press for a 
Europe-wide ban on the maize. The European 
Commission has instructed the independ- 
ent European Food Safety Agency (EFSA) in 
Parma, Italy, to assess the study. 

Many scientists, however, have already ques- 
tioned the study’s methodology and findings. 
They assert that the data presented in the paper 
do not readily allow the claims to be indepen- 
dently assessed, and they question the study’s 
experimental design and its statistical analysis 
of any differences between the treated groups 
and controls. Other scientists point out that 
the Sprague-Dawley strain of rats used in the 
experiments has been shown to be susceptible 
to developing tumours spontaneously, particu- 
larly as they grow older, making it difficult to 
interpret the results. Monsanto itself said that 


the study “does not meet minimum acceptable 
standards for this type of scientific research”. 
The €3.2-million (US$4.1-million) study 
was led by Gilles-Eric Séralini, a molecular 
biologist at the University of Caen, France, in 
collaboration with the Paris-based Committee 
for Research and Independent Information on 
Genetic Engineering (CRIIGEN), whose sci- 
entific board he heads. CRIIGEN bills itself as 


Tumours developed more readily in rats fed genetically modified maize than 
in controls, recent research reports. 


an “independent non-profit organization of 
scientific counter-expertise to study GMOs, 
pesticides and impacts of pollutants on health 
and environment, and to develop non pol- 
luting alternatives”. The article's publication 
coincides with the launch this week of a book 
by Séralini, Tous Cobayes? (All of Us Guinea- 
Pigs Now?), which tells the story of the research 
project and is accompanied by a film and a 
television documentary. 

In a written response to Nature’s questions, 
Séralini and Joél Spiroux de Vendémois, 
president of CRIIGEN and a co-author of the 
paper, say that they have been surprised by the 
“violence” and immediacy of scientists’ criti- 
cisms. They argue that most of the critics are 
not toxicologists, and suggest that some may 
have competing interests, including working 
to develop transgenic crops. They also point 
out some errors by critics, such as claims that 
graphs in the paper showing rat survival over 
time do not include data for the controls. 

The authors concede that Sprague-Dawley 
rats may not be the best model for such long- 
term studies, but argue that the difference 
between the NK603-fed rats and controls is 
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marked, and that many fewer control rats devel- 
oped tumours in middle age. The 90-day trial of 
Monsanto's NK603 maize used in its authoriza- 
tion also used Sprague-Dawley rats, they add. 
José Domingo, a toxicologist at Rovira i Vir- 
gili University in Reus, Spain, and a managing 
editor of Food and Chemical Toxicology, says 
that the study raised no red flags during peer 
review. Domingo, who last year authored a 
critical review of safety assessments 
of GM plants’, has previously com- 
plained about the lack of independ- 
ent feeding studies of GM foods. 
The controversy over the find- 
ings is likely to be settled only after 
detailed analysis of the paper and its 
data, and replication of the experi- 
ments. But Séralini says he won't 
release his data until the raw data 
underpinning the authorization 
of NK603 in Europe are also made 
public. And he wants all the data 
to be assessed by an independent 
international committee, arguing 
that experts involved in the author- 
ization of the maize should not be 
involved. EFSA chief Catherine 
Geslain-Lanéelle disagreed, and 
said that her agency is well placed to 
assemble a multidisciplinary group 
to give an impartial assessment. 
Some scientists, however, have long ques- 
tioned whether such feeding studies are appro- 
priate for testing the safety of whole foods, says 
Peter Kearns, head of food safety, nanosafety 
and chemical accidents for the Organisation 
for Economic Co-operation and Develop- 
ment in Paris. They were designed for test- 
ing chemicals where precise doses of purified 
and well-characterized compounds can be 
administered, whereas compounds in foods 
are heterogeneous, and doses are difficult to 
control. Regulators rely mainly on more robust 
tests that compare the toxicological and nutri- 
tional profiles of GM foods with their non-GM 
counterparts to screen for potential concerns. 
Resolution of the debate over the safety 
of GM foods can come only from rigorous 
science clarifying the issues, Kearns adds. m 
SEE EDITORIAL P.474 
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China buys US sequencing firm 


BGI’s rescue of Complete Genomics will keep a valued technology afloat. 


BY MONYA BAKER 


ince it started sequencing genomes 
in 2009, the DNA services company 


Complete Genomics has garnered praise 
for its technology, but little commercial suc- 
cess. Researchers valued its ability to produce 
readouts of entire human genomes with mini- 
mal errors, but they feared for the future of the 
company, based in Mountain View, California, 
which lost US$72.3 million in 2011 alone. Now 
BGI, whose sequencing facility in Shenzhen, 
China, is the largest in the world, has tossed 
Complete Genomics a lifeline. 

On 17 September, BGI announced that it 
had agreed to merge with the struggling firm 
and would spend $117.6 million to buy up its 
shares. The rescue was welcomed not just by 
the company’s board of directors, but also by 
those who worry that diminishing choices is a 
problem for genomics. 

“Right now, Illumina would have a monopoly 


CAREFUL READING 
A 2011 test of DNA-sequencing methods compared technologies from Complete Genomics and Illumina. 
Single nucleotide variants Insertions and deletions 
Detected Validated* Identified Validated* 
In common 3,295,023 100% 215,382 93% 
Illumina only 345,100 13% 390,060 91% 
Complete Genomics only | 99,578 94% 206,461 100% 


* A small, random sample of variants was reanalysed by a more accurate but more expensive method. 


if Complete Genomics went under,” says 
Michael Snyder, director of the Stanford Center 
for Genomics and Personalized Medicine in 
Palo Alto, California. Instruments made by Illu- 
mina of San Diego, California, account for most 
of the world’s sequencing output. Maintaining 
competition in the industry will keep prices low 
and technologies improving, Snyder says. 

Like Complete Genomics, BGI sells sequenc- 
ing services rather than equipment or reagents, 
but it relies chiefly on instruments made by 


Illumina. Complete Genomics has its own 
instrument and software platform, which is spe- 
cialized for sequencing whole human genomes. 

Although comparing ever-changing plat- 
forms is never straightforward, a 2011 analy- 
sis of the same genome read by Complete 
Genomics and Illumina technology showed 
that although the former found fewer variants 
in the DNA sequence relative to a standard 
reference, it was more accurate (see ‘Care- 
ful reading’)’. In July, Complete Genomics > 


SOURCE: REF. 1 
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> described a technique that reduces error 
rates in sequenced genomes to | in 10 mil- 
lion base pairs and shrinks the amount of 
DNA required to generate a sequence’. 
The technique also distinguishes between 
DNA fragments that come from paternal 
and maternal versions of the same chromo- 
some. This can reveal, for example, whether 
mutations occur in one or both copies of a 
gene, and so whether an individual is at risk 
of developing a certain disease or is merely 
a carrier. 

“That's going to be a game changer in how 
whole-genome sequences are interpreted 
and analysed,” predicts Thomas Barber, a 
geneticist at Eli Lilly in Indianapolis, Indi- 
ana, who works with both BGI and Com- 
plete Genomics. BGI’s acquisition means 
that researchers will continue to benefit 
from such advances. Having the two com- 
panies under the same ownership could 
combine the strengths of both. 

Amanda Murphy, an analyst with equity 
investment firm William Blair in Chicago, 
Illinois, says that the market for Complete 
Genomics’ expertise is poised to grow. Most 
human sequencing has so far focused on 
exomes, the roughly 1.5% of the genome that 
codes for protein. But the recent ENCODE 
project® and other research is revealing 
functions of non-protein-coding regions. 
“I think the world is moving towards 
whole-genome sequencing,’ she says. “The 
demand just wasn't there fast enough” for 
Complete Genomics to hang in on its own. 

Demand for services is increasing as well. 
Many researchers want to use sequenc- 
ing data but do not want to invest in the 
expertise and instruments needed to do 
the sequencing themselves. Even large aca- 
demic sequencing centres often have more 
sequencing projects than capacity. 

A major drawback of Complete Genom- 
ics’ technology is that it takes 2 to 3 months 
to produce a sequence, which discourages 
medical uses. But the platform is expected 
to get faster. Peter van der Spek, head of bio- 
informatics at Erasmus Medical Center in 
Rotterdam, the Netherlands, worries that 
innovation will stall under BGI, although 
a spokesman for BGI says that Complete 
Genomics’ current scientific staff is expected 
to stay with the company and that develop- 
ment would continue. 

Either way, the community is better off 
with options, says Richard Gibbs, director 
of the Human Genome Sequencing Center 
at Baylor College of Medicine in Houston, 
Texas. “If we just had one test and one 
machine and one process, things would 
happen, but not as fast.” = 
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Open-access deal for 
particle physics 


Consortium brokers agreement with 12 journals. 


BY RICHARD VAN NOORDEN 


r | he entire field of particle physics is set 
to switch to open-access publishing, a 
milestone in the push to make research 

results freely available to readers. 

Particle physics is already a paragon of open- 
ness, with most papers posted on the preprint 
server arXiv. But peer-reviewed versions are 
still published in subscription journals, and 
publishers and research consortia at facilities 
such as the Large Hadron Collider (LHC) have 
previously had to strike piecemeal deals to free 
up a few hundred articles. 

After six years of negotiation, the Sponsoring 
Consortium for Open Access Publishing in Par- 
ticle Physics (SCOAP?”) is now close to ensuring 
that nearly all particle-physics articles — about 
7,000 publications last year — are made imme- 
diately free on journal websites. Upfront pay- 
ments from libraries will fund the access. 

So that individual research groups do not 
need to arrange open publication of their work, 
the consortium has negotiated contracts with 
12 journals that would make 90% of high- 
energy-physics papers published from 2014 
onwards free to read, says Salvatore Mele, 
who leads the project from CERN, Europe's 
high-energy physics laboratory near Geneva, 
Switzerland, and home of the LHC. According 
to details announced on 21 September, six of 
the journals will switch their business models 
entirely from subscription to open access. It is 
“the most systematic attempt to convert all the 
journals in a given field to open access’, says 
Peter Suber, a philosopher at Earlham Col- 
lege in Richmond, Indiana, and a proponent 
of open access. 

The consortium invited journals to bid for 
three-year open-access publishing contracts, 
and ranked them by an undisclosed algorithm 
that weighed their fees against their impact 
factors and the licences and delivery formats 
they offer. Under the deal, the journals will 
receive an average of €1,200 (US$1,550) per 
paper. Physical Review D, the journal that pub- 
lishes most papers in the field, negotiated a fee 
of US$1,900 per article “on the principle that 
we should maintain our 


revenue’, says Joe Serene, NATURE.COM 
treasurer and publisher _ For acomplete list 
at the American Physical _ of participating 
Society, which owns the _ journals see. 
journal. But the society’s —_go.nature.com/3yvpzq 
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prestigious Physical Review Letters missed out 
because its bid was too high, says Serene (the 
journal currently charges authors $2,700 for 
individual open-access articles). CERN and 
SCOAP? will continue to negotiate individual 
open-access agreements with journals not 
included in the deal, and more could join when 
the contract is renegotiated in 2016. 

Mele says that the goal of SCOAP’ is to 
switch the discipline’s journals to open access 
without researchers noticing any effect on 
their grant funding or on the way they publish 
papers. The consortium will pay the contracts 
from an annual budget of €10 million, which is 
funded not by authors or research grants, but 
by pledges from more than a thousand librar- 
ies, funding agencies 


“It is the most and research consor- 
systematic tia across the world. 
attempt to In effect, existing 
convert all the journal subscription 
journals ina fees are being repur- 
given field to posed to provide the 


open access.” open-access funds. 

Before any con- 
tracts can be signed, however, publishers 
must reduce the price of their subscription 
packages to offset the income from SCOAP” 
— a complex calculation to ensure that 
libraries don't pay twice for the same con- 
tent. Then SCOAP’ must collect its pledges 
— nota foregone conclusion, as some librar- 
ies may be tempted to renege, figuring that 
their institution won't lose access to the free 
papers anyway. 

Mele hopes that success could trigger a 
domino effect in fields such as astronomy and 
astrophysics. “I personally believe that once 
this is demonstrated to work, some variations, 
fine-tuning and adaptation of the idea will 
emerge,’ he says. 

But Serene and others caution that SCOAP” 
may be hard to replicate. It has unique advan- 
tages in that most high-energy-physics papers 
are published in just a few journals, and that 
the field can be driven and coordinated by one 
central organization, CERN. 

Suber notes the stark contrast between the 
quiet brokering of SCOAP” and the battles 
playing out over mandates for open-access 
publication by research funders such as foun- 
dations and government agencies (see Nature 
486, 302-303; 2012). “I call it the peaceful 
revolution,” he says. m 


Political 


science 


cience and politics are uneasy bed- 
fellows. The first is built on evidence and 
objectivity; the second thrives on opinion and 
persuasion. Nowhere is that relationship more 
fraught than in the United States, where the 
need to win votes can trump scientific evi- 
dence on issues such as climate change and 
public health — and where scientists may 
have little sympathy for political give and 
take. This week, Nature scrutinizes the inter- 
section of politics and science in the run-up to 
the US election on Tuesday 6 November. With 
the presidency, all 435 spots in the House of 
Representatives and 33 of the 100 seats in the 
Senate at stake, the outcome could change the 
course of US science for the next four years. 
When Barack Obama swept into office, he 
pledged to make science a guiding tenet of his 
leadership. A News Feature on page 488 exam- 
ines how that pledge has fared in the face of 
hard political realities and crises such as the 


Deepwater Horizon oil spill. 

A pair of Comment pieces examines the 
relationship between scientists and Congress. 
On page 494, Lawrence Goldstein, a stem-cell 
biologist at the University of California, San 
Diego, urges researchers to hound their con- 
gressional representatives to make the case for 
funding science. And on page 493, Rush Holt, 
a physicist and Democratic congressman 
from New Jersey, says that Congress would 
function better if more of his colleagues 
thought like scientists, or sought their advice. 
Such interactions would leave both politi- 
cians and scientists better informed — and 
the relationship between their fields a little 
less fraught. m SEE EDITORIAL P.473 
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BY JEFF TOLLEFSON 


Nearly four years after US President Barack Obama pledged to put 
science in its rightful place, Nature asks if he kept his word. 


n 15 December 2008, president-elect 
O Barack Obama made clear to the world 

that science would have a central seat 
in his administration. At a press conference in 
Chicago, Obama introduced Nobel laureate 
Steven Chu as the next secretary of the energy 
department and the person who would help to 
wean the country off its addiction to climate- 
warming fossil fuels. “His appointment should 
send a signal to all, that my administration will 
value science,’ Obama said. 

Within days, he announced other mem- 
bers of his future staff, who would make up 
a star-studded science team (see “The science 
dream team’): marine ecologist Jane Lubchenco 


would head the National Oceanographic and 
Atmospheric Administration in Washing- 
ton DC and physicist John Holdren would be 
Obama's science adviser and head the Office 
of Science and Technology Policy, also in 
Washington DC. They joined Lisa Jackson, 
a respected chemical engineer with political 
experience, who had been named to run the 
US Environmental Protection Agency (EPA) 
in Washington DC. After taking office, the 
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president completed the team by appointing 
geneticist Francis Collins at the National Insti- 
tutes of Health (NIH) in Bethesda, Maryland, 
and geophysicist Marcia McNutt at the US Geo- 
logical Survey in Reston, Virginia. Never before 
hada president assembled such a strong crop of 
researchers to lead his science agencies. 

“The truth is that promoting science isn’t 
just about providing resources — it’s about 
protecting free and open inquiry,’ Obama pro- 
claimed as he made the initial appointments. 
“Tt's about listening to what our scientists have 
to say, even when it’s inconvenient — especially 
when it’s inconvenient.” 

Scientists and environmentalists swooned; 
they had spent 8 years complaining that the 
administration of President George W. Bush had 
overly politicized science. Climate researchers 
in government had charged that they were being 
muzzled and that their data were being manip- 
ulated. Pollution regulations were blocked or 
watered down. With Obama’ election, scientists 
would finally have a president who not only said 
the right things but actually appointed the right 
people. Even journalists drooled. “Science Born 
Again in the White House, and Not a Moment 
Too Soon,’ reada headline in Wired magazine, 
endorsing Obama's appointments with a swipe 
at Bush's reputation as a born-again Christian. 

The love affair would soon cool, however, 
as the Obama administration started to hit a 
number of obstacles while trying to govern a 
politically fractured nation in the midst of the 
worst economic crisis in 70 years. The president 
has not fulfilled some of his top science-related 


M. RILEY/POLARIS/EYEVINE 


promises, such as passing climate legislation 
to reduce the nation’s emissions of greenhouse 
gases. He has paid relatively little attention 
to NASA and the NIH, and got into bruising 
budget wars with Congress that sapped support 
for some science agencies. And his vaunted 
team stumbled in its response to the Deepwater 
Horizon oil spill in the Gulf of Mexico, which 
damaged the administration’s credibility with 
some researchers. 

But as Obama and his science team round out 
their first term in office and make a bid for a sec- 
ond, they can point to substantial achievements, 
some of them little noticed. Even in fiscally tight 
times, Obama has invested heavily in science 
education and research, particularly in energy. 
His administration has also made headway in 
tackling pollution, in part by introducing the 
country’s first greenhouse-gas regulations. And 
by driving the creation of integrity policies that 
seek to protect scientists from political inter- 
ference (see ‘Integrity test’), his team has sent 
positive signals to agencies that had become 
demoralized during the Bush years. 

“The president never let up in his consistent 
support for science, and actually he got a lot 
done in spite of the Republican resistance,’ says 
Neal Lane, who was science adviser to former 
President Bill Clinton and is now a professor 
at Rice University in Houston, Texas. 


STIMULATING SCIENCE 

Within a week of the election in November 
2008, and with the economy in free fall, 
Obama's advisers started working with the 
scientific community to survey ‘shovel-ready’ 
projects for potential inclusion in a stimulus 
package intended to boost construction and 
get people back to work. They initially aimed 
for US$5 billion in initiatives, but House Dem- 
ocrats doubled that in a draft of the stimulus 
bill released on 15 January 2009, five days 
before Obama's inauguration. And the role of 
science and innovation continued to grow. 

On 17 February, exactly 4 weeks into office, 
Obama signed a $787-billion stimulus bill that 
contained at least $53 billion for science. The 
bill made good on Obama’ promises to advance 
basic and applied research and development 
aimed at the major problems of the day, includ- 
ing clean energy and global warming. It boosted 
research funding by $2 billion at the National 
Science Foundation in Arlington, Virginia, and 
by $8.2 billion at the NIH. As he signed the bill 
at the Denver Museum of Nature & Science in 
Colorado, Obama called it the biggest increase 
in the history of basic-research funding. 

“You would have to go back to the 1940s, 
when Harry Truman became president, to 
find an administration that was receptive to 
doing something really significant on scientific 
research straight out of the box,’ says Michael 
Lubell, who handles government affairs for the 
American Physical Society in Washington DC 
and was one of a trio of scientists who helped to 
compile the initial suggestions for the science 


stimulus package. “And I think part of it has to 
do with Obama himself. This guy likes science” 

In those early months, the science agenda 
continued to ride high. In April, Obama visited 
the National Academy of Sciences in Wash- 
ington DC and proposed a long-term expan- 
sion of funding for basic and applied research 
and development. When he submitted his 
budget for 2010, Obama fulfilled that prom- 
ise by including full funding for the America 
COMPETES Act, a stalled 2007 initiative that 
called for a doubling of the federal budget for 
physical sciences. He also increased funding 
for science and mathematics education. 
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legislation seeking to reduce US greenhouse-gas 
emissions by roughly 80% below 2005 levels by 
2050, leaving the Senate as the next big hurdle. 

But then the climate bill had to wait. Obama 
and his team wanted first to push a health-care 
overhaul through Congress that would tame 
rising costs and expand insurance to mil- 
lions of Americans. The plan was to deal with 
health care before Congress took its August 
break and then shepherd climate legislation 
through the Senate in time for Obama to take 
something concrete to the United Nations’ 
global-warming summit in Copenhagen 
in December. But the health-care initiative 


“The president never let up in his 
consistent support for science, and 
actually he got a lot done.” 


Bigger budgets were not the only things that 
were fuelling optimism. In March, Obama 
overturned Bushs restrictions on using federal 
funds to support research into human embry- 
onic stem cells, and other early moves by the 
administration thrilled energy and climate 
researchers. On 19 May 2009, the president 
invited the chief executives of ten of the world’s 
largest car manufacturers to the White House 
Rose Garden to announce a historic agree- 
ment to establish the first greenhouse-gas 
standards for US vehicles. For two decades, 
the companies had been fighting against 
attempts to make cars more efficient, but the 
economic crisis and new regulatory author- 
ity had given Obama some leverage over the 
industry. Tough regulations in California had 
also helped to make car makers more recep- 
tive to higher standards. Obama's team later 
brokered a pact with the automobile industry 
to nearly double the average fuel efficiency of 
cars by 2025, to around 23 kilometres per litre. 

“A lot of the credit goes to the president, who 
really persevered and insisted that this was 
going to be part of the package,’ says Kevin 
Knobloch, president of the Union of Con- 
cerned Scientists, an advocacy group based in 
Cambridge, Massachusetts. “He believed that 
technology was the key to saving the industry” 

The deal was part of a broader push by the 
White House to reduce emissions. In 2007, the 
Supreme Court had given the EPA the power 
to regulate greenhouse gases, but the Bush 
administration had declined to do so. When 
Jackson came in, she immediately went to 
work building up the regulatory system. 

This new-found authority extended beyond 
vehicles; in theory, the EPA could regulate 
greenhouse-gas emissions from any source, 
but neither the president nor Congress pre- 
ferred that route for cutting emissions. Instead, 
there were high hopes that Congress would act. 
In June, the House of Representatives took the 
first step and passed comprehensive climate 


proved divisive and time-consuming. 

Obama ended up flying into Copenhagen 
empty-handed. He pledged that the United 
States would reduce its emissions; but without 
the backing of law-makers at home, he could 
make no binding commitments. 

On Christmas Eve of 2009, the Senate finally 
passed the health-care legislation. It was a his- 
toric achievement, decades in the making, but 
it would come at a heavy political price. 


AN OILY MESS 

If the health-care bill demonstrated the 
administration’s skills with Congress, then the 
way it handled NASA in early 2010 revealed 
how easily relations could sour. When the pres- 
ident rolled out his budget request in February, 
it held a bitter surprise for congressional sup- 
porters of the space agency. On the list of 
projects to be eliminated was Constellation, 
a programme to develop massive rockets to 
return humans to the Moon. 

“This was a major policy pronouncement 
but it was revealed in a budget release,” says 
Scott Pace, director of the Space Policy Institute 
at George Washington University in Washing- 
ton DC. Normally, an administration prepares 
Congress for such a change — but Obama's 
sudden move led to what Pace calls a “bruis- 
ing, year-long fight” with lawmakers in both 
parties. Eventually, several parts of the Constel- 
lation programme were reinstated. But by then, 
NASA had become an agency adrift, left to the 
mercy of parochial interests in Congress. 

Human space flight and many other 
elements of NASA’ mission were never priori- 
ties of the Obama administration. In the 2013 
budget request, the agency’s astrophysics and 
planetary-science programmes lost 8% of their 
funding compared with 2008. Obama was 
more interested in fixing problems with his 
home planet, and boosted funding for NASA’ 
Earth-sciences programmes by 44% over the 
same period. 
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Early in his administration, Barack Obama named some stellar scientists to lead the agencies that oversee US civilian 
research and development activities. Here is how some of the scientific team have fared over the past 4 years. 


JOHN HOLDREN 


Office of Science and 
Technology Policy and 
president’s science adviser 


s *Less than 2 months 
after taking office, Obama 
asked him to provide 
agencies with guidance 
on developing scientific- 


integrity policies within 
120 days. 


+ After a long delay, he 
finally issues guidelines 
at the end of year and 
agencies begin crafting 
policies. 


C} + Gets into fight with 
Congress after he met 
with Chinese officials, 
despite a congressional 
ban. Congress slashes 
budget of his office by 
one-third. 


» + By September, 
federal agencies have 
produced draft or final 
integrity guidelines. 


Helped to raise profile 
of science and science 
integrity but could have 


STEVEN CHU 


Department of Energy 


ys * Convinces Congress 
to provide funding for 
high-risk energy research 
programme, ARPA-E, and 
for small energy hubs for 
innovative research. 


» » Launches hubs 
that focus on computer 
simulation for nuclear 
reactors; fuels from 
sunlight; and energy- 
efficient buildings. 


) * Comes under fire 
from Congress after 
Solyndra, a solar-energy 
company that had 
received more than 
$500 million from the 
energy department, goes 
bankrupt. 


»y * Department starts 
work on two new hubs, 
focusing on energy 
storage and critical 


materials. 


Tried to overhaul 
bureaucracy while 
invigorating and focusing 


JANE LUBCHENCO 


National Oceanographic 
and Atmospheric 
Administration 


Sy) 


) « Alienates scientists 
over disputes about 
amount and fate of oil 
spilled in Deepwater 
Horizon accident. 


) * Congress kills plan to 


create a National Climate 
Service, one of her main 
goals. 


co) * Withdraws plan 

to appoint chief 
scientist after losing 
battle with Congress 
over administration’s 
response to oil spill; new 
ocean policy encounters 
troubles on Capitol Hill. 


Despite stumbles during 
the oil spill, Lubchenco 
pushed a solid scientific 


FRANCIS COLLINS 


National Institutes of Health 


» * Approves first new 
human embryonic-stem- 
cell lines made available 
to agency grantees under 
Obama's liberalized 
policy. 


» * Goes public with 
plans for a translational- 
research centre. 


ie) + Agency’s grant 
application success 
rates fall to historic low 
of 18%. 


+ Translational-research 
centre operates for 

9 months without a 
permanent leader. 
Christopher Austin 

is named head in 
September. 


Oversaw massive budget 
infusion early on, but 
funding picture turned 


LISA JACKSON 


Environmental Protection 
Agency 


» + Announces 
agreement with car 
makers to establish 

first greenhouse-gas 
standards for US vehicles. 


» «Issues finding 
that greenhouse-gas 
emissions threaten 
public health. 


» * Works with 
transportation 
department to finalize 
greenhouse-gas and fuel- 
efficiency standards for 
vehicles. 


» * Issues rule to reduce 
emissions of mercury and 
other toxins from power 
plants, one of multiple 
regulations that has 
made coal-fired power 
plants less competitive 
compared with natural 
gas and renewables. 


ys * Proposes rule to 
set greenhouse-gas 
emissions standards for 
new power plants. 


e * Court overturns rule 
to limit amount of pollution 
from power plants that 
crosses into other states. 


Tightened pollution controls 
and laid the groundwork 
for regulating greenhouse- 


done more to push for 
better policies across the 
federal government. 


research and development 
activities at the energy 
department. 


agenda but encountered 
opposition in Congress to 
marine policies. 


grim, leading to widespread 
unhappiness among 
biomedical researchers. 


gas emissions, despite an 
openly hostile environment 
on Capitol Hill. 
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INTEGRITY TEST 


A drive to put science above politics 
has hit some rough patches. 


Two months into his new job, President Barack Obama gathered a group of scientists at the 
White House to sign a memorandum on scientific integrity that declared “Science and the 
scientific process must inform and guide decisions of my Administration”. The 2009 statement 
promised that “political officials should not suppress or alter scientific or technological findings 
and conclusions”. For US researchers, the agreement came as a welcome change from the 
administration of George W. Bush, which had frequently been accused of infringing on science. 

Watchdog groups that track scientific integrity say that Obama’s administration has 
generally kept those promises — with some notable exceptions. Government scientists 
are reporting less political interference and more freedom to speak publicly than they 
experienced under the previous administration. “Agencies change slowly, but if they can 
change slowly into this culture of transparency, then we can win,’ says Francesca Grifo of the 
Union of Concerned Scientists in Cambridge, Massachusetts, who has spent time working 
with staff at federal agencies to develop integrity policies. 

Even so, the transition has taken much longer than anticipated. Obama’s science 
adviser, John Holdren, was supposed to issue guidelines for agencies within 120 days of 
Obama signing the memo, but it took nearly two years. Now, however, all US government 
agencies have either final or draft policies on scientific integrity, says Rick Weiss, chief of 
communications for the White House Office of Science and Technology Policy, which Holdren 
heads. Many of the policies that have been released explicitly forbid agency leadership from 


tampering with scientific results. 


Still, there have been lapses, charge critics. In December 2011, the US Food and Drug 
Administration (FDA) concluded that the morning-after contraceptive pill, Plan B One Step 
(levonorgestrel), should be made available to girls under the age of 17 without a prescription. 
But that decision was overruled by Health and Human Services Secretary Kathleen Sebelius. 
Obama said that he supported Sebelius’ decision and that “her judgement was that there was 
not enough evidence” that the youngest adolescents would be able to use the pill properly. 
Margaret Hamburg, the FDA commissioner, disputed that decision and stood up for agency 
scientists, who had determined there was sufficient evidence that younger teens were able to 


use the medication correctly and safely. 


Sebelius’s intervention was “deeply disturbing”, said Susan Wood, a health-policy expert 
at George Washington University in Washington DC and a former assistant commissioner 


or women's health at the FDA. “Once again the scientific and medical expertise has been 
overruled.” The FDA faced controversy again two months ago when agency managers were 
ound to have spied on the e-mails of five staff scientists. 

The April 2010 oil spill from BP’s Deepwater Horizon rig in the Gulf of Mexico also raised 
concerns about integrity. In May 2010, Marcia McNutt, director of the US Geological Survey 
in Reston, Virginia, wrote in an e-mail to scientists working to estimate the size of the spill 
hat the White House was trying to understate the numbers. That e-mail was obtained and 
released by the watchdog group Public Employees for Environmental Responsibility, based 
in Washington DC. McNutt’s comments echoed the concerns of many scientists, who had 
argued that government estimates of the oil spill were much too low. 


Grifo says that a major problem with translating policies on scientific integrity into action 
has been a lack of commitment by agency leadership. But that is not the case across the 
board. Wood says, for instance, that Hamburg took a remarkable public stance by backing her 
agency’s scientists, even though she was overruled. “Her Plan B decision was a clear stand-up 
for both science and public health in the face of controversy. Good for her. You don’t often see 
that in senior political appointees,” she says. E.S.R.,J.T., M.W. 


his big ambitions. In December 2011, he and 
a few dozen colleagues gathered for beers at 
the Rock Bottom brewery in Bethesda to cele- 
brate one of the biggest changes at the agency 
in a generation. Collins and his team had suc- 
ceeded in creating the National Center for 
Advancing Translational Sciences (NCATS), 
providing the administration with another 
victory in applied research. 


Collins had proposed NCATS a year earlier, 
to catalyse the ailing process of drug devel- 
opment by attacking bottlenecks in clinical 
trials, toxicology research and other areas. 
Although his plan hit some resistance in the 
NIH, in industry and on Capitol Hill, Col- 
lins managed to convince key members of 
Congress to support the shifting of funds 
within the NIH to create the $575-million 
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centre, and it opened in the final days of 2011. 

Some critics question the centre’s mission. 
At a congressional hearing in March, Roy 
Vagelos, former chief executive of drug-maker 
Merck, asked whether anyone believed that 
NCATS would be able to solve problems that 
are stumping the pharmaceutical industry. “If 
you believe that, you believe in fairies, he said. 

In May, Collins announced the first fruits of 
the centre. Standing with the research chiefs 
of three top pharmaceutical companies, he 
unveiled a $20-million effort to resurrect 
drugs that had passed safety trials but had been 
shelved by industry for business reasons or 
because they did not work for specific condi- 
tions. Under the agreement, the companies gave 
NIH-funded scientists a stab at repurposing 
those compounds. “The Obama administration 
is all about innovation,’ says Collins. “And that’s 
very much what NCATS means to do” 

As the election nears, Obama’s science team 
is racing to finish up its work. On 28 August, 
the EPA and the transportation department 
finalized the changes in vehicle standards that 
Obama initiated in the rose garden with car 
makers more than three years ago. In the inter- 
vening years, the EPA has moved forward with 
its Supreme Court authority and begun to lay 
the groundwork for a broad array of climate 
regulations. In March, it proposed a rule that 
would set emissions standards for new power 
plants and effectively ban coal plants unless 
they capture and bury carbon dioxide. 

Looking back over the past four years, 
Holdren says that “President Obama has made 
an unprecedented commitment to science, 
technology and innovation. ... He promised on 
inauguration day to ‘restore science to its right- 
ful place’ — a promise he has kept in spades.” 

But even his supporters acknowledge that 
the president did not achieve some of his 
biggest science-related goals. Carol Browner 
served as Obama's climate and energy adviser 
during the first two years and led the admin- 
istration’s push to pass climate legislation. 
“There's the disappointment of not getting 
legislation,’ she says. “But we didn’t just sit on 
our hands?” 

In his speech to the Democratic Convention 
on 6 September, Obama laid out some of his 
energy goals, should the country extend his 
stay in the White House. He talked about 
further reducing oil imports and advancing 
natural-gas production. He discussed improv- 
ing energy efficiency and advancing clean, 
renewable energies. “And yes, my plan will 
continue to reduce the carbon pollution that 
is heating our planet because climate change 
is not a hoax,” he said. But in sharp contrast to 
the soaring rhetoric and bold plans of 2008, he 
didn’t make any big promises. m SEEEDITORIALP.473 


Jeff Tollefson covers energy and environment 
for Nature in New York. Additional reporting 
by Eric Hand, Meredith Wadman and Eugenie 
Samuel Reich. 


POLARIS/EYEVINE 


Scientists can Evidence that Richard Holmes When Bayesian 
improve government by helpful colleagues improve celebrates the science- statistics can help to sort 
wooing their lawmakers p.494 others’ output p.496 biography revival p.498 signal from noise p.501 


US Representatives Rush Holt and Nancy Pelosi tour a magnetic resonance imaging facility. 


Politicians should 
think like scientists 


Amore rigorous, analytical and far-sighted approach 
would improve the US political process, says Rush Holt. 


efore I was elected to the US Congress 
B: 1998, I taught and researched 

topics such as solar spectroscopy and 
plasma physics. This background inspired 
some of my constituents to make bumper 
stickers that read: “My congressman IS a 
rocket scientist!” The residents of central 
New Jersey seemed to be calling for more 
science, or at least more scientific thinking, 
in Congress. 


Iagree. Scientists, engineers and technolo- 
gists are not necessarily smarter or wiser than 
others, but we have many habitual practices of 
mind that would be valuable in the sluggish 
legislative process. ‘Scientific thinkers’ — and 
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to be clear, not all such thinkers are profes- 
sional scientists — have a deep appreciation 
for evidence. They have a realistic under- 
standing of technology's promises and pitfalls. 
They work comfortably with estimates and 
data. They use statistical reasoning. They 
are more alert to the mental tricks that they, 
like all humans, play on themselves. Most 
importantly, they understand that the path 
towards good solutions is paved with 
uncertainty, trial and error; that conclusions 
should be tentative; and that alternative 
views should be entertained. 

Although the legislative branch rarely 
deals with matters involving, say, stellar 
chromospheres or the radio-frequency 
heating of ions, it tackles many issues in 
which scientific thinking does (or should) 
come into play. For example, after the 
vote-counting controversy of the 2000 US 
presidential election, I saw a difference in 
approach between scientists and others 
as Congress moved to update our nation’s 
voting technology. 

Lawmakers and election officials around 
the United States — most of them non- 
scientists — quickly embraced the idea 
of electronic voting machines, and Con- 
gress provided more than US$3 billion to 
help states to convert their voting systems. 
According to advocates, electronic machines 
would eliminate incompletely punched bal- 
lots (hanging chads’) that could misrep- 
resent voters’ intentions; they would prevent 
people from voting for more than one person 
in the same race (‘over-voting’) or neglecting 
to cast ballots in other races (‘under-voting’); 
and they would produce a quick tally on 
election night. What could be better? 

Computer scientists were alarmed. Soft- 
ware, they pointed out, is prone to subtle 
errors. In the privacy of the voting booth, 
how will the voter know that her or his vote 
has been recorded as intended? And after 
the voter leaves the booth, how will election 
watchers ensure that each vote was recorded 
as the voter intended? A potentially even 
greater problem is that elections are tempt- 
ing targets for hackers. 

Most legislators, having little training in 
scientific thinking, either did not recog- 
nize these limitations or assumed that such 
problems could be pre-empted by testing 
software. The best solution — and one that 
Ihave proposed many times in Congress 
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— is to use a paper ballot marked by the 
voter or by a machine in view of the voter. 
Yet, today we still do not have a national 
standard for voter-verified, auditable ballots. 
Scientific thinkers are also more alert 
to the cognitive biases that can lead to 
irrational decisions. For example, politicians 
(like everyone else) are often guilty of ‘short- 
term-ism: the desire to enjoy rewards now 
rather than invest them for later. Yet govern- 
ment actions — infrastructure projects, say, 
or education programmes — play out over 
decades, long past the careers of individual 
lawmakers. Scientists are generally comfort- 
able thinking about processes on different 
timescales — of millions or even billions 
of years. Scientific thinking can thus build 
strong arguments for investment in roads, 
bridges, trains and laboratories that will not 
produce profits tomorrow but will pay off 
powerfully in the decades to come. 


NUMBERS GAME 
Many lawmakers are uncomfortable with 
statistical reasoning. Take surveillance, 
for example: the New York City Police 
Department, as part of its counterterrorism 
programme, has conducted surveillance of 
suspected Muslims at restaurants and stores 
and monitored student groups and mosques 
around the United States. When I spoke 
in opposition to these surveillance pro- 
grammes and called them ‘profiling; some 
of my non-scientist colleagues said that such 
profiling is keeping the United States safe, 
even if it is ethnically discriminatory. 

But my colleagues were disregarding the 
fact that terrorists are exceedingly rare. Since 


2001, Muslims have boarded planes in the 
United States perhaps 50 million times. If 
officials had screened each of these travellers 
using a protocol that could detect terrorists 
with, say, 99.9% accuracy, about 50,000 
people would have been wrongly accused 
of terrorism. Billions of dollars would have 
been wasted in profiling and detaining inno- 

cent people, creating 


“Scientific profound distrust 
thinking among targeted com- 
builds munities. 

arguments for Statistical reason- 
investments ing would lead one 
that will pay to recognize that this 
off powerfully money would be far 
inthe decades _ better devoted to on- 
to come.” the-ground intelli- 


gence gathering. 

Scientists are just more comfortable with 
uncertainty than non-scientists. This trait 
would come in handy for lawmakers, who 
often must take action despite conflicting 
evidence. For example, a failure to understand 
ordinary fluctuations in noisy climate data 
allows some members of Congress to believe 
that claims of human-induced climate change 
are a hoax, or that the data are so chaotic that 
no policy action can be devised. 

A similar discomfort with uncertainty is 
evident on the first Friday of each month, 
when politicians react to the labour 
department's monthly jobs report. Earlier 
this month, legislators (egged on, it should 
be said, by the news media) spent countless 
hours debating the implications of a report 
that businesses created 29,000 fewer jobs in 
August than economists expected. Yet few 


grasped that the survey’s margin of error 
was 100,000 jobs at the 90% confidence level. 
Those 29,000 jobs could very well have been a 
statistical blip rather than a real trend. 

How can we increase scientific thinking 
in Congress? One way, of course, is to elect 
more scientists — and I strongly encour- 
age scientists to consider seeking political 
office. But that is an unlikely solution. More 
non-scientists need to feel comfortable 
thinking like scientists. This is not with- 
out precedent — notall legislators hold law 
degrees, but all must be comfortable thinking 
like lawyers when drafting a bill or reading a 
statute. When I need to think like a lawyer, I 
turn to legal professionals for help. Congress 
has a staff of lawyers at the Office of Legis- 
lative Counsel, which helps members to turn 
their ideas into legislative language. Shouldn't 
legislators who have scientific questions be 
able to seek similar guidance from scientists 
and, over time, pick up more scientific ways 
of thinking? 

Until 1995, Congress had a dedicated 
staff of professional scientists in the Office 
of Technology Assessment. This office was 
eliminated in a round of foolish budget 
cuts. It should be re-established, and in the 
interim, we scientists should seek to make 
our voices heard in other ways: through 
meetings with members of Congress, 
letters to editors, town halls and other public 
forums. 


Rush Holt represents the 12th congressional 
district of New Jersey, West Windsor, 

New Jersey 08550, USA. 

e-mail: rush.holt@mail.house.gov 


Know your representatives 


Lawrence Goldstein urges researchers to talk to lawmakers about science. 


wenty years ago, after moving to San 
i California, I went to visit the 

US congressman for my new district. 
I was responding to the American Society 
for Cell Biology, which had encouraged 
its members to visit their congressional 
representatives and tell them why federal 
funding for the US National Institutes of 
Health (NIH) was important to the welfare 
of the United States. I had heard that this 
congressman was a former fighter pilot and 
very conservative, but nothing could have 
prepared me for our first meeting. 

As soon as we sat down, he told me that he 
thought that the NIH should be privatized. I 
was surprised, but being stubborn, I contin- 
ued to visit him once or twice a year for the 
next few years. Each time, I would explain 
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how biomedical research leads to a better 
understanding of basic biology and disease, 
and how this translates to new and successful 
approaches to disease therapy. Over time, his 
meetings with me and with other scientists 
and patient advocates in our community led 
to a dramatic change in his view on public 
funding for biomedical research. 

This conservative congressman became 
a member of a key appropriations subcom- 
mittee, on which he fought for substantial 
increases in NIH funding and even voted to 
loosen restrictions on funding for embryonic 
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stem-cell research. Personal interactions 
between scientists and legislators are crucial 
in today’s increasingly technological world, 
in which science policy and funding become 
more important with each passing year. 


GOOD INFLUENCE 
Some of my colleagues prefer not to interact 
with members of Congress — they think 
that one scientist's voice doesn't carry much 
weight amid the cacophony that assails most 
officials. Yes, the executive branch of the US 
government receives considerable formal 
advice on science policy’. But the president 
can get very little done without the support 
of Congress. 

Members of Congress determine the 
amount of money available for science, 


F. HARRISON/GETTY IMAGES 


and they often pass major laws affecting 
science policy. Many of them have never 
met a working scientist. It is here that ind- 
ividual scientists can have great personal 
and scientific impact. 

This is true at the state level as well. 
In California, individual scientists have 
influenced lawmakers in positive and 
important ways’. For example, following 
the moratorium on the use of human mat- 
erials in nuclear-transfer experiments after 
Dolly the sheep was cloned in 1997, Calif- 
ornia senators requested advice from me 
and other biomedical scientists about how 
to proceed with the technology. We travelled 
to the state capitol several times to speak to 
policy-makers and give briefings to mem- 
bers of the legislature. We helped them to 
understand the research issues and ramifi- 
cations of various legislative approaches to 
the issue. 

Ultimately, the legislators crafted laws 
that have served as a good model for how to 
approach stem-cell science in a thoughtful 
way. The laws created a regulatory system 
that restricts the few practices that all would 
agree are inappropriate, namely the implant- 
ation of any product of nuclear transfer using 
human materials. They also established a 
group to develop additional regulations in 
California. 

In 2003-04, another group of scientists 
(myself included) and patient advocates 
worked together to develop and secure 
voter approval for California’s landmark 
Proposition 71, which provides US$3 billion 
for stem-cell research over ten years in a 
competitive, peer-reviewed funding system. 
The initiative included requirements for 
ethics regulations and rigorous review of 
funding proposals, partly because of the 
advice my colleagues and I provided during 
drafting. 


BUILD BRIDGES 

Developing relationships with potential allies 
takes surprisingly little time and effort. A 
couple of visits a year to the capitol or a leg- 
islator’s local offices, 


reinforced by the “Providing 
occasional letter, can accurate 
goalong waytowards information 
building along-last- can have 

ing rapport. Scien- @ large 

tists will not always influence on 


make an impact 
on the first try; they 
must maintain contact and a consistent 
message over time. 

Some non-scientists have frustrating 
initial opinions based on incomplete data. 
My colleagues and I often encountered the 
mistaken belief that embryonic stem cells 
come from aborted fetuses. But providing 
accurate information while being respectful 
of people’ values and beliefs can have a large 


legislation.” 


California’s Proposition 71, passed with scientists’ help, provides US$3 billion for stem-cell research. 


influence on legislation. Patient advocates, in 
particular, bring a human touch that reminds 
legislators (and scientists) of the important 
consequences of their work. 

Scientists should follow a few simple prin- 
ciples when meeting with lawmakers. It is 
important to be friendly and informative, and 
to avoid jargon. People will not care about 
your message if they cannot understand it: 
for example, say ‘blood-forming’ instead 
of ‘haematopoietic. Also, focus on science 
during meetings; do not stray to other issues. 
To avoid seeming parochial, keep conversa- 
tions in the context of what is good for the 
nation and society at large; do not talk only 
about what is good for science and scientists. 
Bring promising statistics — such as estimates 
that every dollar the NIH spends on research 
creates $2.20 in local economic growth, or 
that gains in average lifespan due to scientific 
research have added $3.2 trillion a year to the 
US economy since 1970 (ref. 3). 

Do not focus solely on lawmakers. Interact 
also with their staff members, who are usually 
bright and motivated and can be relied on for 
information and informal communication 
throughout the year. Finally, remember that 
you do not have to be a well-known senior 
scientist to make an impact. An educated and 
concerned approach to scientific issues is the 
most important credential you can bring to 
a meeting’. 

Lawmakers need to hear about why 
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research matters. For example, the United 
States spends more than $200 billion a year 
caring for people with Alzheimer’s disease, 
yet the NIH and private foundations spent 
less than $500 million during the 2011 fiscal 
year to fight the disease. This 400:1 ratio is 
inadequate, even in a time of fiscal crisis. It 
is comparable to a family spending $10,000 
a year on a problem but only $25 a year on 
finding a solution. 

As scientists, it is a mistake for us to say 
we are too busy to reach out to lawmakers. If 
we do not try, science funding will continue 
to decrease in the coming years; lawmakers 
will enact restrictive policies that are not 
informed by the best scientific information 
available; and society will be poorer for our 
absence. m 


Lawrence Goldstein is Distinguished 
Professor in the Departments of Cellular 
and Molecular Medicine and Neurosciences 
at the University of California, San Diego, 
School of Medicine, La Jolla, California 
92093, USA. 

e-mail: lgoldstein@ucsd.edu 
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Honour the helpful 


Alexander Oettl presents evidence that scientists who share 
advice and expertise enhance their colleagues’ productivity. 


hat kind of scientist does a 
department want to hire? The 
investigator who churns out 


high-impact papers and travels around the 
world giving seminars? Or someone with 
an average publication record who is always 
discussing other people’s work with them, 
attending seminars and providing feedback 
on colleagues’ papers? 

Most departments would probably hire 
the first one and not think twice. I believe 
the choice is not so straightforward. Tradi- 
tionally, science has been an individual pur- 
suit, in which people were valued for their 
personal output and achievements. Discov- 
ery increasingly relies on teamwork and yet 
scientists are still judged only by what they 
themselves accomplish. 

Throughout my career I’ve had excellent 
colleagues. Even if they weren't participating 
directly in a project, they provided crucial 
advice that shaped an experiment, gave 
thoughtful criticisms of manuscripts and 
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shared data and expertise with anyone who 
asked. Such people are often thanked in the 
acknowledgments of papers, not listed as 
co-authors. But this quality — I call it ‘help- 
fulness’ — isn’t measured in job applications 
nor is it considered by hiring committees. 

It should be. By reviewing the acknowl- 
edgements in immunology papers since 
1950, I have found that when principal inves- 
tigators (PIs) who were frequently thanked 
by others died unexpectedly, the quality of 
the papers of their collaborators dropped. 
In contrast, the quality of co-authors’ work 
changed little after the passing away of 
PIs who were not particularly helpful 
to others. 

To me, this hints that we need to re- 
evaluate what makes some scientist ‘stars, to 
make room for those who may have fewer 
individual achievements but are a major 
source of support and feedback for the peo- 
ple around them. Especially nowadays, when 
it can seem harder than ever to spare time to 
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help someone else, such collegiality should 
be encouraged. It benefits the entire scientific 
enterprise. 


WITH THANKS 

I examined the acknowledgements from the 
Journal of Immunology — the society journal 
of the American Association of Immunolo- 
gists (impact factor 5.788) — from 1950 to 
2007 (more than 50,000 papers), using 
software to note who was thanked, and for 
what (A. Oettl Manage. Sci. 58, 1122-1140; 
2012). To find investigators who probably 
died while still actively working and publish- 
ing — and thus can act as a quasi-natural 
experiment — I extracted obituaries from 
more than 400,000 immunology articles 
from Web of Science and the American 
Association of Immunologists newsletter. I 
then looked for people who died within 50 
years after publishing their first paper, and 
checked that they and their co-authors were 
actively publishing up to the PI’s death. 


ILLUSTRATION BY OLIVER MUNDAY 


I found 149 PIs (either first or last authors) 
who had died in mid-career. Of these, 
63 were in the top 20% of people thanked 
in all acknowledgements for the year in at 
least one year of their publishing career. I 
call this group ‘helpful. Thirty-five of the 
PIs in this group were also highly productive, 
defined as being in the top 5% for the num- 
ber of annual citations and high-impact- 
factor immunology publications. I similarly 
divided the less helpful PIs into productive 
and less productive categories. 

I found that co-authors of helpful PIs 
experienced a similar decrease in the num- 
ber of papers published after the death 


WHEN HELP FADES 


of those PIs as did the co-authors of less 
helpful and less productive PIs (see ‘When 
help fades’). But co-authors of the helpful 
PIs experienced a distinct decline in the 
quality of their publications compared with 
the less helpful groups — as judged on the 
impact factors of the journals in which they 
published. Co-authors of the helpful and 
most productive PIs experienced a 22% 
decline (P< 0.01), whereas those of the 
helpful but less productive PIs saw a 20% 
decline. In some cases, the dip lasted for 
more than five years. 

Co-authors of highly productive and 
helpful PIs received 21% fewer citations, 
by a 2010 cut-off, for work published after 
the PI’s death. Citations dropped by 28% 
on average over the same time period for 
co-authors of helpful but not particularly 
productive Pls. 

In contrast, co-authors of the 17 PIs who 
were not particularly helpful but had pub- 
lished many papers in high-impact journals, 
did not experience any statistically signifi- 
cant decline in the quality or number of their 
publications or in the number of citations 
received after their colleague’s death. This 
may seem as if working with unhelpful col- 
leagues means that a scientist’s subsequent 
work is saved from being affected. But these 
results also suggest that had these co-authors 
worked with a helpful PI over the same 
period, their productivity might have been 
even greater. 

The impact of a death was particularly 
profound on co-authors of PIs who were 
helpful with conceptual feedback, such as 
advice and criticism. Perhaps this is because 
such services are harder to replace than 
technical expertise, reagents and other study 
materials. 

A good question is: if those helpful PIs 
with average publication records have such 
a positive impact on their colleagues’ experi- 
ments, why aren't they more successful 
themselves? But there are many instances in 
science where these qualities don’t go hand 
in hand — for example, many referees can 


After ‘helpful’ Pls die in mid-career, their co-authors experience a dramatic decline in high-impact 
publications and citations. In contrast, there is no significant change in these for co-authors of 
‘unhelpful’ Pls after their death (error bars indicate 95% confidence intervals). 
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have a substantive, positive effect on the 
quality of a manuscript even though they 
may not have been able to produce a work 
of similar quality themselves. 

Of course this study has limitations. It 
examines acknowledgements in just one 
journal within a single biomedical field. In 

addition, the deaths 


“Collegiality I record are by no 
should be means exhaustive, as 
encouraged. I must rely on second- 
Itb enefits ary data sources. Ide- 
the entire ally, I would examine 
scientific whether there is an 
wn terprise - increase in work qual- 
‘ ity after the formation 

ofa relationship with 


a helpful PI, rather than the effects of the 
loss of that benefit. But this would require 
an elaborate randomized field experiment 
involving forced co-authorship — hardly a 
realistic possibility. 


TEAM SPIRIT 

My results suggest that scientists who 
are helpful have a major impact on their 
colleagues’ careers — and have been under- 
valued by a scientific enterprise that rewards 
individual achievement above all else. 

Itis time to look more closely at what qual- 
ities we value most in scientists. Researchers 
who generate numerous high-impact papers 
may have little time to discuss problems, crit- 
icize manuscripts or mentor students. Those 
who produce a stream of average papers may 
have a much more positive impact on the 
careers of the people around them. Research- 
ers looking for collaborators may sometimes 
opt for a helpful colleague who is not a major 
force in their field over a rock-star scientist 
who rarely replies to e-mails. 

So hiring committees should look beyond 
an applicant's publication record. They should 
read the recommendations of peers and look 
for signs that the individual might influence 
departmental dynamics in a positive way. 

Perhaps we should even develop a metric 
that measures helpfulness, such as average 
acknowledgments per year? And funding 
agencies could give ‘helpfulness’ awards 
for generosity that has benefited the field at 
large, to encourage scientists to help others. 

Of course, not every scientist is able to be 
more helpful. For those who have trouble 
interacting with others, say, trying to change 
their natural inclinations and personal- 
ity will be a losing battle. But science does 
need to change its reward structure so that 
researchers who do a lot for others are not 
penalized. = 


Alexander Oettl is an assistant professor at 
the Georgia Institute of Technology Scheller 
College of Business, Atlanta, Georgia 30308, 
USA. 

e-mail: alex.oettl@scheller.gatech.edu 
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BOOKS & ARTS 


The history of science is about people such as Paul Dirac, as well as the discoveries they made. 


The scientist 
within 


Richard Holmes celebrates today’s revival of science 
biography, a tradition spanning 300 years. 


early two decades on from the 
| \ | explosion in popular-science pub- 
lishing, books by luminaries such as 
Stephen Hawking and Richard Dawkins are 
still selling in six figures, and authors such as 
Brian Cox and Rebecca Skloot are drawing 
a younger generation of readers. But Iam 
struck by a new emphasis — on popular- 
science biography. As I learned while 
researching my book The Age of Wonder 
(Harper, 2008), the “life scientific” can be 
an extraordinarily gripping adventure. 
There is a noble tradition of science biog- 
raphy stretching back to the 1600s. Recently, 
a fresh hunger to understand the making of 
science through the making of scientists has 
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emerged. We want to read about scientific 
work as part of a life story — to know what 
makes a scientist tick, and what set them 
ticking. We are intrigued to learn that Dawk- 
ins, the eminent biologist and atheist, had “a 
normal Anglican upbringing’, was inspired 
by the Dr Doolittle books and taught in 
1960s San Francisco, in the time of flower 
power and demonstrations against the war 
in Vietnam. 

The form is currently undergoing unprec- 
edented transformation. Biographers are 
tackling highly complex and challenging 
areas such as X-ray crystallography, general 
relativity and quantum physics, explaining 
them as pure adventures of the human spirit. 
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We see this in spellbinding biographies, from 
Georgina Ferry’s Dorothy Hodgkin (Granta; 
1998) and Walter Isaacson’s Einstein (Simon 
& Schuster, 2007) to Graham Farmelo’s 
award-winning biography of Paul Dirac, 
The Strangest Man (Faber and Faber, 2009) 
— which drew praise from readers as varied 
as Michael Frayn, Tom Stoppard and Martin 
Rees. 


THE EVOLUTION OF A THEORY 

A whole spectrum of books on Charles 
Darwin, published to mark the 2009 bicen- 
tenary of his birth, illuminated the debate 
surrounding the emergence and implica- 
tions of evolutionary theory. Led by Janet 
Browne, who treated On The Origin itself as 
a biographical subject, and Adrian Desmond 
and James Moore, who looked at Darwin's 
work on human origins, this biographic 
experimentation expanded in original ways. 
Rebecca Stott’s Darwin's Ghosts (Blooms- 
bury, 2012), for example, is a prequel consist- 
ing of vivid portraits of early evolutionists, 
starting with Aristotle doing marine biology 
on the isle of Lesbos. 

Stott’s book is part of a growing trend 
for group biographies of scientists. These 
use teamwork, competition and personal 
rivalry to reframe science as a kind of social 
history. Outstanding examples cover group- 
ings in every century. Arabic trailblazers 
of the ‘dark ages and people of the early 
medieval period star in Jim Al-Khalili’s 
Pathfinders (Allan Lane, 2010). Lisa Jar- 
dine’s Ingenious Pursuits (Little, Brown and 
Co., 1999) focuses on seventeenth-century 
European natural philosophers, and Jenny 
Uglow’s The Lunar Men (Faber and Faber, 
2002) displays the enlightened industrial- 
ists of the eighteenth century. We can relish 
the pioneering efforts of nineteenth-century 
palaeontologists in Deborah Cadbury’s The 
Dinosaur Hunters (Fourth Estate, 2000), 
and marvel at the groundbreaking work of 
twentieth-century physicists in Manjit 
Kumar’s Quantum (Icon Books, 2008). 

Mathematics could be seen as the hard- 
est nut to crack biographically because of its 
abstract language. Yet Simon Singh rendered 
it thrilling in his treasure-hunt of a group 
biography over four centuries, Fermat’s Last 
Theorem (Fourth Estate, 1997). 


PROFILE PIONEERS 

But what of that earlier tradition of science 
biography? The word ‘scientist’ was coined 
only in 1834, and there is a popular miscon- 
ception that individual scientific ‘lives’ — 
such as Henry Mayhew’s Young Humphry 
Davy (1855) — are 

essentially a Victorian 
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1680s, John Aubrey wrote Brief Lives of Wil- 
liam Harvey, Edmund Halley and Robert 
Boyle, as well as of Shakespeare and Milton. 

A surprising pioneer of the form in the 
eighteenth century was Samuel Johnson. 
Along with his dictionary and The Lives 
of the Most Eminent English Poets (1781), 
Johnson also wrote The Life of Dr Herman 
Boerhaave (1739), a vivid short biography 
of the great Dutch botanist and medical 
doctor who founded clinical teaching at 
Leiden, improved diagnostic techniques 
and isolated urea. 

Johnson perceived that Boerhaave’s child- 
hood was essential to his adult science. He 
writes, for instance, of Boerhaave's gruesome 
teenage leg ulcer: “his own pain taught him 
to compassionate others, and his experience 
of the inefficacy of the methods then in use, 
incited him to attempt the discovery of oth- 
ers more certain.” 

Similarly, William Stukeley in his Life of 
Newton (1752) is careful to cite “the extraor- 
dinary pregnancy of his genius, whilst a 
boy”. Stukeley gives vivid descriptions of 
Newton's boyish windmills, counter levers 
and water clocks. Stukeley also relates the 
first known version of the young Newton's 
famous vision of “universal gravitation” in 
the orchard at Woolsthorpe, Lincolnshire, 
as he contemplates the falling of an apple 
from a tree: “Why sh‘ it not go sideways, 
or upwards? but constantly to the earths 
centre? ... there must be a drawing power 
in matter. & the sum of the drawing power 
in the matter of the earth must be in the 
earths center, not in any side of the earth” 
(Stukeley’s original manuscript, one of the 
treasures of the Royal Society, can be read 
at http://royalsociety.org/library/moments/ 
newton-apple). 

The growing fascination with scientific 
biography in the nineteenth century is illus- 
trated by the supplements to the fourth, 
fifth and sixth editions of the Encyclopae- 
dia Britannica, completed in 1824. Of 165 
lives selected for individual treatment, 35% 
were broadly scientific. Another landmark 
was David Brewster’s The Life of Sir Isaac 
Newton (1831), which presented the natural 
philosopher as a secular saint, “the high- 
priest of science” and a man of universal 
genius. Like Stukeley, Brewster emphasized 
the originality of mind revealed by New- 
ton’s boyhood. Brewster also gave the idea 
of scientific childhood a new, metaphorical 
dimension. 

It was his superb final chapter that called 
attention to Newton’s now celebrated remark 
about the mysterious, shifting infinity of 
scientific truth: “...to myself I seem to have 
been only like a boy playing on the seashore, 
and diverting myself in now and then find- 
ing a smoother pebble or a prettier shell than 
ordinary, whilst the great Ocean of truth lay 
all undiscovered before me.” Newton may 


here have been adapting an image from 
Book IV of Milton’s Paradise Regained — 
the observation that “wise men” may think 
themselves “deep versed” in research, yet in 
reality may be “as children gath’ring pebbles 
on the shore’. 


SETBACKS AND SERENDIPITIES 

In what ways do biographies, whether writ- 
ten last year or in the seventeenth century, 
throw light on science and scientists? The 
Royal Society, announcing a historic sym- 
posium, Writing Scientific Biography, in 
2008, observed that such works “show us 
the human face of science: the motivations, 
set-backs, serendipities and moments of 
enlightenment in the quest for knowledge”. 

I think they do more. First, they show 
the crucial shaping power of childhood and 
youth, where the scientific vocation and 
vision begin to emerge. Second, they exam- 
ine the nature of the creative breakthrough. 
Eureka moments are only one manifesta- 
tion. Discovery may equally be the product 
of painful years of research, experiment and 
thought, as with Darwin or Dirac. Or it may 
be the result of a short but intense period 
of competition and rivalry, as with Francis 
Crick, James Watson, Rosalind Franklin 
and Linus Pauling, in the race to discover 
the structure of DNA. 

Third, biographies point to the impor- 
tance of the inner, imaginative and emo- 
tional life. They contradict the fallacy that 
all scientists are icy rationalists. The great 
US physicist Richard Feynman played 
the bongos. The famously elusive chem- 
ist Henry Cavendish was a secret member 
of the notorious Cat and Bagpipes Club. 
Michael Faraday was an elder of the Sande- 
manian church. James Clerk Maxwell wrote 
Scottish poetry and accompanied himself 
on the guitar. Alan Turing was an outstand- 
ing marathon runner. All these traits and 
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characteristics can be linked to the ener- 
gies that both inspired and occasionally 
impeded their research. 

Fourth, biographies show that error and 
uncertainty are central to discovery, ina 
way that is lost in ‘the literature} that offi- 
cial record of scientific endeavour. Victo- 
rian histories of science tended to propose 
models of unbroken, cumulative progress, 
which encouraged the alarming triumphal- 
ism of classical science in the early twenti- 
eth century. Contemporary ‘scientisnY has 
perhaps inherited something of this bias. 
By contrast, individual biography reveals 
doubt, scepticism and the historically shift- 

ing boundaries of 
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wig Wittgenstein and 
Bertrand Russell) 
reveals just such a powerful field of political, 
financial and ethical ambiguities surround- 
ing US science during the cold war. 

Biographies can also probe originality 
and disputed cases of priority, as in the “dis- 
covery” of oxygen by Antoine Lavoisier, or 
Joseph Priestley or Carl Scheele; or the the- 
ory of evolution by Darwin or Alfred Russel 
Wallace. They show that at particular times, 
certain areas of science are intensely active 
and open to discovery — currently, cosmol- 
ogy, neuroscience and nanotechnology — 
while others are comparatively dormant. It 
is in these active areas that the linked forces 
of teamwork and rivalry are most fiercely 
alive, and their consequences can be best 
understood. 

In The Strangest Man, for instance, Gra- 
ham Farmelo memorably compares the 
development of quantum mechanics in the 
1920s by a team of 50 physicists to “a group 
of construction workers” labouring on an 
enormous building site occupying much of 
northern Europe. “Virtually all the builders 
were male, under thirty, intensely competi- 
tive and craving the respect of their peers as 
well as the blessing of posterity” 

So science is always a story. A detective 
story, perhaps; a mystery story; a love story; 
even, on occasion, a ghost story (the Higgs 
boson?). But always a story of human lives. 
And that is what seems to fascinate us — 
again. 


creation.” 


Richard Holmes is the author of The 
Age of Wonder, which won the 2009 
Royal Society Prize for Science Books. 
His new book, Falling Upwards : How 
the Romantics Took to the Air, will be 
published in spring 2013. 

e-mail: richard.holmes@osb.me.uk 
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Digestive tracts 


Barbara Ketcham Wheaton samples a brace of 
food-related science and technology histories. 


mind. Now, two books open windows 
on the dynamic interplay of science, 
technology and the culinary arts in history. 
Emma Spary, in Eating the Enlightenment, 
explores concerns about physiology and 
morality vis-a-vis cuisine at the dawn of 
modern science. Meanwhile, food historian 
Bee Wilson's Consider the Fork delves into 
the chewy past of kitchen technology. 
Focusing on Paris during the tumultuous 
years from 1670 through to the mid-eight- 
eenth century, Spary examines questions and 
controversies surrounding the edible — from 
food and drink addictions to the dangers of 
mixing social classes at cafés. She shows how 
French thinkers, medics, clergymen and café 
proprietors probed matters digestive from 
scientific and philosophical standpoints. 
They asked many questions. What was the 
correct diet for an intellectual? Were the 
spirits in liqueurs a good thing? Were cafés 
suitable venues for the display of literary wit? 
Spary shows how scientists of the time 
debated whether we digest food by dissolv- 
ing it with acid in the ‘chemical laboratory’ 
of the stomach (the iatrochemical theory), 
or by mechanically grinding it with the teeth 
and then more finely with the stomach. The 
chemist Louis Leméry spoke for the iatro- 
chemists in his Traité des Aliments (1702) 
in postulating acid or alkaline fermentation 
of foodstuffs during digestion. In the 1730s, 
working separately, Antoine-René Ferchault 
de Réaumur validated the iatrochemical 
theory in experiments with living birds. 
Even Jacques de Vaucanson, inventor of 
automata, got in on the act by ‘demonstrating’ 
the process of chemical digestion through 
mechanical trickery. His famous Digesting 
Duck automaton appeared to eat, digest and 
excrete grain, although the ‘input and ‘output 
were purportedly in separate compartments. 


| oa has long exercised the scientific 
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Eating the Enlightenment: Food and the 
Sciences in Paris 

E. C. SPARY 

University of Chicago Press: 2012. 368 pp. $45, £29 


Consider the Fork: A History of How We Cook 
and Eat 

BEE WILSON 

Basic Books: 2012. 352 pp. $26.99 


A particularly heated debate swirled about 
whether an austere, ‘natural’ diet supported 
aclear mind and pure body better than did 
a luxurious, sophisticated one. Spary cites 
many of the influential cookbooks from 
both camps, including Marin’s Les Dons de 
Comus (1739) on the sumptuous side and 
Menon’s La Cuisiniére Bourgeoise (1746) on 
the natural one. 

Philosophers took sides, with Voltaire 
plumping for luxury; his trout and cream, 
for instance, was both praised and vilified 
(for causing indigestion). “Natural man’ 
Rousseau, by contrast, wrote, “If lam given 
milk, eggs, salad, cheese, brown bread, and 
ordinary wine I am sufficiently entertained” 
In this, Rousseau was backed by medical 
reformer Philippe Hecquet, who in around 
1700 advocated a simple, nearly meat-free 
diet as morally and hygienically superior. 

The Enlightenment's obsession with classes 
of food and their effects is echoed in Consider 
the Fork. Wilson's take on technologies of 
cooking and eating mixes material sciences, 
engineering, anthropology and history. 
Organized under eight broad topics — 
including fire, ice, cutting and grinding — the 
book spans millennia, and covers tools from 
Neolithic mortars and pestles to modernist 
cuisine’s liquid-nitrogen canisters. The focus 
is Western, but Wilson is well informed about 
equipment from elsewhere, especially Asia. 

She emphasises how technologies deter- 
mine what we eat. Electric refrigerators, for 
instance, elbowed out curing and smoking 
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as necessary household techniques, and put 
fresh and cooked-from-frozen food on the 
daily table. Meanwhile, some odder inven- 
tions, such as the mechanical apple parer, 
have fallen off the counter edge for good in 
most kitchens. 

The chapter on knives is excellent. In 
3,000 Bc, bronze knives were shaped like 
modern ones, but dulled so quickly that flints 
were often used in preference. Iron knives 
came into use around 700 Bc, and within a 
few centuries carbon-steel knives were being 
made both in China and the West. 

In modern times the French have 
used many specialized knives, from the 
mushroom-fluting bird-billed knife to 

the flexible fish knife that slips fillets of 
sole off the bone. Such elaborate differen- 
tiation is part of French history; at Euro- 
pean tables of the fifteenth century, carvers 
conducted knife-work theatrics, wielding a 
variety of blades to dismember and serve all 
sorts of flesh, fish and fowl. However, in the 
Chinese kitchen a single rectangular cleaver, 
the tou, is used for the most robust butch- 
ering tasks, as well as the finest dicing and 
shredding. Originating towards the end of 
the Chou dynasty (1045-256 Bc), when steel 
technology came to China, skills with this 
single blade became ever more refined and 
the language of cutting and shredding ever 
more specialized. Beautifully cut ingredients 
remain at the heart of Chinese gastronomy. 

In the past 60 years, two waves of game- 
changing equipment have entered the 
kitchen. The food processor and microwave 
arrived in the mid-twentieth century. More 
recently, the ‘lab’ apparatus and ingredients 
as yet unfamiliar to the home cook — such as 
the gelling agents iota carrageenan and kon- 
jac gum — have begun to appear as essentials 
for whipping up the confections of molecu- 
lar gastronomy. Once again, hard-won skills 
become redundant, and the increasing use 
of instruments has diminished our sensory 
involvement with the act of cooking. 

Even skills such as whisking have gone the 
way of the medieval boar spit in many kitch- 
ens. Today, swathes of people in the Northern 
Hemisphere need scarcely touch their dinner 
until they sit down to eat it. Wilson is clearly 
ambivalent about these technological tides. 

Meanwhile, home videos of meals cooked 
from the monumental Modernist Cuisine 
(Cooking Lab, 2011) by Nathan Myhrvold, 
Chris Young and Maxime Bilet can be found 
on YouTube. A new cycle of debate begins. m 


Barbara Ketcham Wheaton is honorary 
curator of the Culinary Collection at the 
Schlesinger Library, Radcliffe Institute 

for Advanced Study, Harvard University, 
Cambridge, Massachusetts, and the author 
of Savouring the Past: the French Kitchen 
and Table from 1300 to 1789. 

e-mail: bkwheaton@verizon.net 
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STATISTICS 


Forecasting with finesse 


Paul Ormerod assesses a Bayesian take on predicting 
everything from poker games to climate change. 


ate Silver is an intriguing man. 
| \ | A statistician by training, he devel- 
oped a system for forecasting base- 
ball performance that has had real influence 
on how the top teams evaluate potential 
players. His blog made accurate predictions 
about the 2008 US Presidential election. He 
has a regular slot in the New York Times. 
And now, in his thoughtful, engaging The 
Signal and the Noise, he offers an array of 
fascinating examples of forecasting, from 
baseball and elections to poker, chess, stock 
markets, terrorist attacks, earthquakes and 
climate change. 

The ‘signal’ in the title refers to genuine 
information, which can be used for predic- 
tion. ‘Noise’ is the purely random component 
of data, which cannot. A serious problem 
with many forecasting models is that they try 
to explain too much, and end up ‘explaining’ 
the noise. Silver documents this little-known 
but fundamental problem of ‘overfitting’ in 
clear terms. 

Silver has a serious scientific purpose. 
He is highly critical of the dominant, fre- 
quentist approach in statistics, which relies 
on the frequency of one possibility coming 
up over a number of trials to determine the 
true probability. For example, if heads come 
up more frequently than tails when a coin 
is tossed, your confidence 
that the coin is biased will 
increase the more tosses 
you observe. Silver is a fer- 
vent advocate of the rival 
approach: Bayesian statis- 
tics. This is just as math- 
ematically rigorous, 
but is a more heuristic 
approach, in which you 
form a view about the 
chances of an event hap- 
pening and revise this esti- 
mate as new information 
comes to light. So, your 
prior estimate that a coin is 
biased will depend on whether 
the person flipping is a seedy man 
in a bar or the Archbishop of Canter- 
bury. If the former, you may conclude it is 
biased after, say, only 
three heads come up 
in succession. 

Most of Silver’s 
examples come from 
the social sciences, 
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which encompass the really difficult prob- 
lems, such as forecasting the economy. In 
the natural sciences, a theory that explains 
the past will also, in general, be usable for 
making predictions about the future. Not 
so the social sciences, in which a clear dis- 
tinction usually needs to be made between 
explanation and prediction. For example, 
there are models that describe well why price 
changes in stock markets in the past exhibit 
the features they do; these are based on how 
traders behave. Forecasting is much harder; 
and ‘predictive’ models are essentially pure 
extrapolations, over very short time periods, 

of past data, with no behavioural content. 
In the natural sciences, theories can be 
developed and verified — or, to be more 
accurate, not falsified — by the evidence 
obtained from replicable experiments. In 
the social sciences, such strong sup- 
port for a theory is rarely, if ever, 
possible. For example, there is cur- 
rently a major debate about whether 
increases in public spending will 
boost the economy. Despite a huge 
amount of theoretical and empirical 
work, economists are no nearer to a 
consensus than they were 

a) 


50 years ago. 

The availability of ‘big 
data — from mobile- 
phone usage and social- 
network use, for instance 

— is currently seen by 
many as raising the 
potential for social sci- 
ences to approach the 
predictive power of 
the natural sciences. 

Detailed behav- 

ioural observa- 
\ tion, the reasoning 

goes, would be a 
step closer to the 
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realm of replicable 
experiments. Silver is 
profoundly sceptical 
about this possibility, 
and argues that a mas- 
sive increase in data 
will make predictions 
more prone to failure, 
not less. As he puts it, 
“the number of mean- 
ingful relationships ... 
is [tiny]... there isn’t 
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The Signal and 
the Noise: Why So 
Many Predictions 
Fail — but Some 
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key critique of Bayes- 


ian analysis, which is that it introduces the 
‘unscientific’ concept of personal judgement. 
But in the social sciences, this is required 
all the time, even when the frequentist 
approach is used. 

A revealing case emerged in 2007, when 
the Bank of England produced its ‘fan charts, 
showing the potential range of outcomes for 
economic growth over the next five years. 
This indicated that the probability of a UK 
recession occurring in 2008-12 was essen- 
tially zero. But the economists had used 
data from only 1993-2006 to calibrate the 
probability distribution of growth rates; they 
had convinced themselves that their 1990s 
theories had solved problems of the econ- 
omy once and for all. Using the frequentist 
approach, they had judged data from before 
1993 as irrelevant — which we now know to 
be profoundly wrong. 

Silver discusses the poor record of eco- 
nomic forecasting at length, and is correct 
in stressing the importance of understand- 
ing the data, rather than just pouring it into 
a statistical package and pressing the but- 
ton. He does not, however, make clear how 
a Bayesian approach could have predicted 
the financial crisis. 

Nor does he discuss the extent to which a 
given data set contains any signal at all, rather 
than being completely dominated by noise. 
In data sets in the social sciences, there are 
often many factors that can influence out- 
comes, and behaviour varies over time; so 
the data produced by a system is often indis- 
tinguishable from a random series. In such 
cases, neither the frequentist nor the Bayesian 
approach will produce successful forecasts. 

But, overall, Silver does make a good case 
for a Bayesian approach. His book is a lively, 
well-argued corrective to the prevailing view 
that we need large amounts of data before we 
can make intelligent and accurate forecasts. m 


Paul Ormerod is an economist and 
complex-systems theorist, and author of 
Positive Linking: How Networks Can 
Revolutionise the World. 

e-mail: pormerod@volterra.co.uk 
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Redirect research to 
control coffee pest 


The coffee-berry borer 
(Hypothenemus hampei) was 
accidentally introduced into 
Brazil in 1913 and later invaded 
coffee plantations throughout 
South and Central America, 
Mexico and the Caribbean. 

The insect still causes worldwide 
annual losses of some 

US$500 million, affecting the 
incomes of more than 20 million 
coffee-farming families in 
roughly 80 nations. We contend 
that a radical change in research 
direction is called for if the 
damage inflicted by this pest is 
to be contained. 

The topic has generated more 
than 1,600 papers, so far with 
little practical success. This is 
partly because the insect spends 
most of its life concealed inside 
the coffee berry feeding on the 
seeds, making its management 
difficult. But it may also be 
because the focus on developing 
biological control agents such 
as fungal entomopathogens and 
parasitoids is too narrow. The 
results have been disappointing, 
and major practical and economic 
obstacles have prevented their 
implementation in the field. 

Research should instead be 
aimed at exploring potential 
attractants and repellents, and at 
gaining a better understanding 
of the microbiota associated with 
the insect. 

Francisco Infante, Jeanneth 
Pérez El Colegio de la Frontera 
Sur, Tapachula, Chiapas, Mexico. 
Fernando E. Vega USDA, ARS, 
Beltsville, Maryland, USA. 
fernando.vega@ars.usda.gov 


Follow the money on 
climate controversy 


Dan Kahan’s analysis of the 
‘controversy’ over climate change 
is insightful up to a point (Nature 
488, 255; 2012). But there is one 
crucial factor that he does not 
mention: money. 

The climate-change 
controversy has been created 


by a tiny group of people whose 
livelihoods depend on it. Oil and 
its derivatives have made these 
people extremely rich, so they 
perpetrate the idea that climate 
change induced by the use of 
fossil fuels is a myth to ensure 
that they stay that way. 

The public is prey toa 
systematic campaign to pollute 
the science-communication 
environment (as Kahan so aptly 
puts it), backed by vast wealth. 
This campaign is being indirectly 
abetted by the US Supreme 
Court, which has declared that 
political groups may spend 
limitless amounts of money 
without identifying themselves. 
Thomas E. DeCoursey Rush 
University Medical Center, 
Chicago, Illinois, USA. 
tdecours@rush.edu 


Green issues are 
catching on in China 


Your report on the rise of 
environmental protest in China 
(Nature 488, 261-262; 2012) is 
backed up by findings of the Pew 
Global Attitudes Project, which 
indicate that 80% of Chinese 
people now rate environmental 
protection as a priority (see 
go.nature.com/tehbh9). 

Landmark cases include 
spontaneous protests against 
paraxylene pollution in Xiamen 
in 2007 and in Dalian in 2011; 
against garbage incineration 
at Panyu in 2009; and against 
waste-water pollution in Qidong 
earlier this year. So far, people 
seem more concerned about the 
impact of local pollutants than 
about national environmental 
issues (W. Li, J. Liu and D. Li 
J. Environ. Mgmt 98, 65-72; 
2012). 

These protests have raised 
awareness of environmental 
pollution among local 
governments and forced them to 
address some of the issues. 

Environmental protection on 
a wider scale will demand more 
concerted action, with strong 
leadership and organization, 
and regulations that protect the 
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rights of individuals. It will mean 
supporting the winners and 
compensating those who could 
lose out economically. 

Qiang Wang, Xi Chen Xinjiang 
Institute of Ecology and 
Geography, China. 
giangwang7@gmail.com 
Yi-Chong Xu Griffith University, 
Australia. 


Rationality: Evidence 
must prevail 


I strongly disagree with Daniel 
Sarewitz’s suggestion that non- 
mathematicians must use faith 
to “believe” in the Higgs boson 
(Nature 488, 431; 2012). The 
particle’s existence is based on 
hard evidence, not belief. 

Evidence is why most ill 
people visit doctors, rather than 
sacrifice chickens or visit priests. 
If we were to abandon evidence, 
we would soon be in some post- 
modernist hell. As for faith, the 
rational thought that underpins 
science provides us with a 
system that works. It fosters 
questioning and makes risky, 
falsifiable predictions; religion 
does neither, as it demands blind 
acceptance of dogma. 

Had science and rationality 
been abandoned in favour 
of religion, then the stoning 
of adulterers would be much 
more common and there would 
have been many fewer women 
participating in the Olympic 
Games. Let’s be grown up, 
rational, accept evidence and put 
the fairies to bed. 
David J. Hosken University 
of Exeter Cornwall Campus, 
Penryn, UK. 
d.j.hosken@exeter.ac.uk 


Rationality: Science 
is not bad faith 


Perhaps Daniel Sarewitz (Nature 
488, 431; 2012) should read 
more Ludwig Wittgenstein. The 
philosopher opposed the view 
of religion as a flawed version 

of science, lacking in evidence, 
sophistication or predictive 


power. Rather, he considered 
religion a matter of shared 
practices, observance and ritual 
that has a special significance 

to its practitioners. To that 
extent, there is no meaningful 
opposition between religion and 
science: they do not even occupy 
the same logical space. 

Sarewitz’s account of what 
religion can teach science 
indicates a belief that the two 
activities are somehow similar 
and express the same aim. 
Viewing temples and falling in 
love can be moving experiences, 
but they don't reveal a hidden 
reality whose articulation eludes 
science. 

Iam nota believer in the Higgs 
boson (or related particles) 
merely because of a cultural 
artefact. I was born in a country 
that trusts theoretical physicists 
more than, say, astrologists 
(who need no physics for their 
predictions). That situation has 
arisen because of the divergent 
successes of the physical sciences 
and astrology in understanding 
the world. 

Religion need not be bad 
science; science is not bad faith. 
Andy Greenfield Medical 
Research Council Harwell, UK. 
a.greenfield@har.mrc.ac.uk 


Rationality: Religion 
defies understanding 


Daniel Sarewitz suggests that we 
need ways of “understanding our 
world beyond the scientifically 
rational” (Nature 488, 431; 

2012). Our species has derived 
many things from its various 
religions — some fair and noble, 
others foul and destructive — but 
understanding is not one of them. 
Andrew Blight Acorda 
Therapeutics, Ardsley, New York, 
USA. 

arborlight@gmail.com 
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Author guidelines can be 
found at go.nature.com/ 
cmchno. 
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EVOLUTION 


How the unicorn got its horn 


An experiment studying bacterial populations over thousands of generations shows that a novel trait can evolve through 
rearrangement and amplification of a few pre-existing genes. SEE ARTICLE P.513 


HEATHER HENDRICKSON & PAUL B. RAINEY 


volutionary novelties such as flight, 
Hives and eyes are central to life’s 

diversity, and yet explanations for their 
origins are anything but straightforward. The 
difficulty arises because the relevant events 
happened a long time ago under circumstances 
that are poorly understood. However, in this 
issue, Blount et al.' (page 513) unravel molecu- 
lar details underpinning a major innovation 
in a trait (phenotype) that evolved in bacte- 
rial populations after 30,000 generations. The 
capacity to store ancestral types in a state of 
suspended animation and then sequence their 
genomes has provided a remarkably complete 
picture of the main genetic events that gave rise 
to an instance of evolutionary novelty*. 

The ‘modern evolutionary synthesis’, 
which provides the accepted account of evo- 
lution, unites Charles Darwin's gradualism 
— the view that evolution proceeds through 
gradual changes — with Gregor Mendel’s 
concept of genes as discrete units. But some 
have argued that the resulting notion of incre- 
mental mutational change does not provide 
an adequate framework for explaining how 
phenotypic innovations arise*”. What steps 
precede a major evolutionary advance such 
as the eye? Far from being a simple philo- 
sophical point, the need for mechanistic 
understanding at this level is paramount, and 
Blount et al. (researchers in Richard Lenski’s 
laboratory) provide direct insight into how 
innovation happens. 

In 1988, Lenski initiated what is now the 
longest-running project in experimental evo- 
lution. He took a clone (a group of genetically 
identical cells derived from a single cell) of 
the fast-growing bacterium Escherichia coli 
and founded 12 independent populations, 
each resident in a flask containing a sim- 
ple medium, with glucose as the sole source 
of carbon. Every day for almost 25 years, a 
sample of each population has been trans- 
ferred to a fresh flask — which means that 
these populations have gone through 
55,000 generations. In addition to daily trans- 
fer, the researchers have been periodically 
storing samples at —80°C, thereby ensuring 


*This article and the paper under discussion! were 
published online on 19 September 2012. 
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Figure 1 | The evolution ofa new trait. In 1988, Richard Lenski tooka single clone formed by genetically 
identical Escherichia coli bacteria and propagated it, with daily transfer, in 12 independent flasks. Whereas 
the original cells could use citrate as a nutrient only in the absence of oxygen (Cit cells), bacteria in one 

of the flasks eventually evolved the ability to use it with oxygen present (Cit’ cells, indicated in red). After 
analysing the genomes of the microbes at different stages of the process, Blount et al.’ (researchers in Lenski’s 
laboratory) found that the original clone had diverged into competing lineages — some of which went 
extinct and some of which prospered. The authors report that, for the new ability to appear, the bacterial 
populations went through three successive evolutionary steps. ‘Potentiating’ mutations (of unclear nature) 
were required for cells to acquire ‘actualizing’ mutations that consisted of a specific rearrangement of 

a few genes and that allowed some growth — although poor — on citrate in the presence of oxygen. 
Further ‘refining’ mutations, which involved duplications of the rearranged DNA sequence, were needed 


for robust growth under such conditions. 


access to a detailed history of ancestral states. 
Moreover, unlike fossils, these frozen ancestors 
are readily brought back to life. 

Although Lenski’s group reported adap- 
tations that improved the microbes’ fitness 
from the earliest stages of the experiment, 
something remarkable happened after about 
31,000 generations. One of the populations 
(Ara-3) evolved the capacity to use citrate 
as a nutrient’. Citrate, a chelating agent that 
facilitates iron uptake, is a component of the 
growth medium. Although E. coli can ‘digest’ 
citrate when oxygen is absent, it typically 
cannot do so under the aerobic conditions of 
this experiment. The fact that the microbe can 
use citrate under some circumstances raises an 
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important point: new genes are not invented 
de novo. Quite the contrary — evolution 
builds on the raw material at its disposal. The 
bacterium has the building blocks necessary 
for the evolution of aerobic citrate utiliza- 
tion, but realization of this capacity requires 
a rewiring and refinement of regulatory 
connectivities. 

To understand the nature of the genetic 
changes conferring the ability to use citrate 
in the presence of oxygen (a trait dubbed the 
Cit* phenotype), Blount et al. returned to the 
freezer and resuscitated ancestral Cit’ popu- 
lations from earlier time points in the Ara-3 
lineage. When the researchers tried to ‘replay’ 
the evolution of the trait, they found that 


the Cit” phenotype appeared again only in 
populations derived from the most recent 
Cit” lineages. This result suggested that gen- 
etic changes had occurred in these lineages 
that made it possible for the Cit” phenotype 
to evolve. Although the precise nature of 
such ‘potentiating’ mutations is unclear, the 

important fact is that they took place (Fig. 1). 
The step that made the cells weakly Cit” — 
the ‘actualizatio’ step — was easier to identify. 
The authors examined a genomic region that 
carries the gene citT, which encodes a pro- 
tein (citrate transporter) needed for citrate 
to enter the cell. In the original bacterial 
strain, citT is located downstream of citG 
(another gene required for citrate utilization) 
and rnk, a functionally unrelated gene that 
participates in energy metabolism. In each 
of the derived Cit* cells, Blount et al. found a 
distinctive genomic rearrangement that fused 
rnk to citG. As a result, expression of citG and 
citT came under the control of the promoter 
(regulatory sequence) of rnk, which allowed 
their expression in the presence of oxy- 
gen. What’s more, the authors found that a 
single copy of the genetic rearrangement was 
insufficient to generate the Cit* phenotype, and 
that there was a tan- 


“New genes are dem array of between 
not invented two and nine copies 
de novo. Quite in Cit* cells. There- 
the contrary: fore, gene duplica- 
evolution builds tions took a very low 

level of citrate trans- 
a port and propelled 


it to life-sustaining 
levels. All cells that 
inherited the genetic 
fusion (the actualizing mutation), followed by 
amplification (a ‘refining’ mutation), were Cit*. 

The discovery of the Cit” mutants in 
Lenski’s experiment has been a mote in the eye 
for those suggesting that major phenotypic 
innovations cannot be explained by micro- 
evolutionary (gradual) processes. Indeed, for 
anti-evolutionists, lack of mechanistic detail 
has even allowed room for divine interven- 
tion. After several attempts to politely address 
the concerns of one critic, Lenski responded 
publicly in what has proven to be one of the 
most competent and direct defences of science 
versus dogma for some time. It includes the 
memorable phrase’, “In other words, it’s not 
that we claim to have glimpsed ‘a unicorn in 
the garden’ — we have a whole population of 
them living in my lab!” 

With knowledge of the key events that deter- 
mine this unicorn’s horn, one can next look 
for lessons of a general nature. The idea that 
genetic evolution might conform to a set of 
rules is attractive’. Capture and amplification 
of other genes’ promoters have been shown to 
facilitate adaptation in a variety of situations”*, 
but the extent to which these kinds of events 
can fuel phenotypic innovation depends on 
prior history, chance and fortuitous factors 


its disposal.” 


such as genome architecture. Blount and col- 
leagues’ work shows that evolution, given 
sufficient time, will take advantage of any 
opportunity — but opportunity may depend 
on the most fickle of potentiating events. 
Unfortunately, predicting the nature of these 
crucial potentiating mutations may well lie 
beyond reach. Nevertheless, on a positive note, 
this work strengthens a central premise of the 
modern evolutionary synthesis: genetic change 
is gradual. Phenotypic change, however, can 
be dramatic. m 


Heather Hendrickson and Paul B. Rainey 
are at the New Zealand Institute for Advanced 
Study, Massey University, Auckland 0745, 


QUANTUM COMPUTATION 
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New Zealand. P.B.R. is also at the Max Planck 
Institute for Evolutionary Biology, Plén, 
Germany. 

e-mail: p.b.rainey@massey.ac.nz 
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Spinning towards 
scalable circuits 


Silicon devices form the backbone of modern computers. It turns out that they 
might also be a natural hardware platform for a new era of computing technology 
that uses the principles of quantum physics. SEE LETTER P.541 


LEE C. BASSETT & DAVID D. AWSCHALOM 


ore than 50 years ago, the invention 
M of integrated circuits based on silicon 

transistors (tiny electronic switches) 
revolutionized electrical and computer 
engineering. It allowed room-sized comput- 
ers made of vacuum tubes to be replaced 
with individual semiconductor chips, which 
have continued to become exponentially 
smaller and more powerful. Now, a new kind 
of computing that relies on quantum rather 
than classical physics has the potential to 
crack problems inaccessible to even the most 
powerful classical computers of the foreseeable 
future. However, we are still searching for the 
best physical architecture for such quantum 
machines — even a suitable ‘quantum vacuum 
tube} let alone the transistor equivalent, 
remains elusive. 

Among many candidates for the quantum 
bits (qubits) of information that form the 
basis of quantum computing, the spins of 
single electrons or isolated nuclei in host semi- 
conducting materials offer great potential for 
incorporation into large-scale devices. Silicon 
is a particularly attractive host material, in no 
small part because of its use in the well-devel- 
oped microchip industry. On page 541 of this 
issue, Pla et al.' report the first fully functional 
electron-spin qubit based on a single phospho- 
rus atom placed in a silicon device*. 


*This article and the paper under discussion’ were 
published online on 19 September 2012. 


In addition to the two allowed states, 0 or 1, 
of classical bits (for example, the on/off states 
of transistors in a processor, or the magnetiza- 
tion direction of magnetic domains on a hard 
disk), a qubit, being quantum mechanical, 
can also exist in an arbitrary superposi- 
tion of 0 and 1. Quantum mechanics also 
allows for non-trivial correlations between 
the states of multiple qubits, whereby meas- 
urement of one qubit alters the states of the 
others. A reasonably sized quantum machine 
could leverage the vastly increased number 
of states offered by superpositions, along 
with non-classical qubit correlations, to 
solve certain types of problem much more 
efficiently than any classical computer. 

Many important problems fall into this 
category, including Peter Shor’s famous algo- 
rithm? to finda number's prime factors, and the 
simulation of complex quantum systems. The 
large-scale implementation of Shor’s algorithm 
would fundamentally alter data encryption, 
and quantum simulators have the potential to 
revolutionize physics, chemistry and materi- 
als science, and therefore to drive advances in 
engineering, biology and medicine. 

The prototypical qubit is the spin of an 
elementary particle such as an electron that 
can point either ‘dowr’ (0) or ‘up’ (1). Even 
ignoring quantum effects, these two spin states 
can be used as the logical states for classical 
information processing — this is the essence 
of ‘spintronics. But quantum computation 
also requires full control of superposition 
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Figure 1 | A spin qubit device. Pla and colleagues have constructed’ a qubit device that could be 
incorporated into a future silicon-based integrated quantum circuit. The device’s components are all 
based on standard silicon microfabrication technology. The qubit is the spin of an electron bound to a 
single phosphorus atom implanted below the surface of a silicon wafer, and is controlled by a set of surface 
electrodes. A transistor that is sensitive to the charge of a single electron (a ‘single-electron transistor’) 

is used to initialize and read out the spin, and a separate microwave control line generates an oscillating 
magnetic field to coherently control the spin. A set of voltage control electrodes provides electrostatic 


tuning to turn the readout on or off. 


states, including preservation of the relative 
quantum phase of the up and down spin 
states. Such ‘phase coherence’ is extremely 
fragile, being easily washed away (decohered) 
by noise from external sources. Qubit imple- 
mentations therefore seek to balance two 
inherently contradictory requirements: 
the ability to engineer strong interactions 
between qubits and external fields to initialize, 
manipulate and subsequently read out quan- 
tum states; and the ability to switch off those 
interactions — using ‘knobs’ such as voltages 
and currents — so that coherence is preserved 
during a calculation. Atoms and ions isolated 
in a vacuum, for example, offer long coher- 
ence times but are difficult to scale into large 
devices, whereas implementations based on 
semiconductor technology, which seem easier 
to scale up, typically suffer from higher rates 
of decoherence. 

The qubit constructed by Pla et al. effectively 
combines the advantages of both atomic and 
solid-state implementations by using the spin 
of an electron bound to a single phosphorus 
atom that is implanted in a silicon substrate 
near a microfabricated transistor (Fig. 1). 
The architecture, inspired by Bruce Kane’s 
proposal’ more than a decade ago, exploits 
the remarkable properties of ‘shallow donor 
atoms, such as phosphorus, at low tempera- 
tures. Phosphorus is commonly added to 
silicon to enhance its electrical conductivity 
for microelectronics, because each phospho- 
rus atom provides an extra electron that is 
mobile at room temperature. Below liquid- 
helium temperatures (4.2 kelvin), however, the 


electron becomes bound to the atom’s nucleus, 
providing a ‘trapped spir’ that can be used as 
a qubit. 

Certain intrinsic properties of silicon 
protect these donor-bound electrons from 
decoherence, particularly a low density of 
nuclear spins (95% of natural silicon is the 
isotope silicon-28, which has no nuclear spin) 
and weak coupling between the orbital motion 
of its electrons and their spin. In high-purity 
silicon samples enriched with the **Si isotope, 
electron-spin coherence times of ensembles 
of phosphorus atoms can exceed 1 second’, 
the longest known of any solid-state electron 
spin. Most importantly, with modern nano- 
fabrication technology, it is possible to posi- 
tion a single phosphorus atom near control 
electrodes to form a quantum device. 

Initialization and readout of spin states in 
a system similar to that of Pla and colleagues 
has already been demonstrated’, using a 
clever technique called spin-to-charge con- 
version. In this process, the phosphorus 
atom’s electron spin is converted into a dif- 
ference in the atom’s charge state (either ion- 
ized or neutral) by arranging that the electron 
will leave the atom by tunnelling to a nearby 
nanoscale silicon transistor only if it is in the 
‘up’ state. The same transistor subsequently 
measures the charge state (and hence also 
the spin) and resets the qubit in the ‘down 
state, which serves as the starting point for 
the next operation. In no small technical 
feat, Pla et al. have now added a microwave 
control electrode to this single-donor/tran- 
sistor architecture operating at millikelvin 
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temperatures. The electrode generates mag- 
netic fields oscillating at a frequency of 
30 gigahertz to coherently manipulate the 
qubit between initialization and readout 
operations (Fig. 1). 

It is encouraging that the coherence time 
measured in this device, about 200 micro- 
seconds, is close to the value of about 300 ts 
measured for ensembles of phosphorus 
donors in silicon samples that have the natu- 
ral isotopic composition’. This suggests that 
added noise from the electrodes and from 
the qubit’s proximity to impurities on the 
substrate’s surface is not too detrimental. 
With this device, a typical ‘quantum opera- 
tion driven by the microwave field lasts about 
100 nanoseconds, which means that more 
than 1,000 operations are possible within the 
coherence time. With further developments in 
microwave engineering to increase the strength 
of the oscillating magnetic field, sub-nano- 
second operations are attainable’. Moreover, 
if isotopic purification boosts the coherence 
time close to the exceptional values of about 
1 second measured’ for ensembles of isolated 
phosphorus atoms in **Si, an unprecedented 
ratio of about 10° operations per coherence 
time could be feasible. 

By engaging the nuclear spin of phospho- 
rus, even more powerful applications will be 
possible. Nuclear coherence times can exceed. 
those of electrons by several orders of magni- 
tude, and a fully functional electron-spin qubit 
should allow quantum control of the cou- 
pled nuclear spin, as has been demonstrated 
for a similar electron—nuclear spin system 
in diamond’. The long-lived nuclear spin could 
then act as a built-in ‘quantum memory node’ 
for each electron-spin qubit. 

The crucial next step for silicon-based 
quantum information technology will be 
to develop devices that have multiple qubits 
coupled together, as is required for full-scale 
quantum computation. The recent demonstra- 
tion of atomically precise placement of donor 
atoms’ is encouraging, because it would allow 
two donors to be placed close enough to each 
other (about 10 nanometres) for exchange 
interactions to mediate the coherent transfer 
of quantum information between them. Such 
precision engineering might also allow the 
assembly of more elaborate structures, such as 
chains of donor atoms that could be used as 
a ‘spin bus” to convey quantum information 
around a large-scale device, perhaps between 
separated computational devices such as the 
one demonstrated by Pla and co-workers. 
If all of these pieces come together, it could 
be that future ‘integrated quantum circuits’ 
do not look so very different — in terms of 
materials, processing and control electronics 
— from the classical ones we have been using 
for the past 50 years. m 
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Luck or reason 


The concept of preferential attachment is behind the hubs and power laws seen in 
many networks. New results fuel an old debate about its origin, and beg the question 
of whether it is based on randomness or optimization. SEE LETTER P.537 


ALBERT-LASZLO BARABASI 


ften a field’s most profuse concept is 

also its most mysterious. Think wave- 

functions in quantum mechanics, 
dark energy in astrophysics and non-coding 
DNA in genomics. Network science has 
its own: preferential attachment, which 
states that the more connected a network 
node is, the more links it will acquire in 
the future. The impact of preferential 
attachment is hard to miss — the prin- 
ciple is responsible for the omnipresent 
network hubs, from Facebook and Google 
on the World Wide Web to protein p53, the 
‘cancer hub; in human cells. However, its ori- 
gins remain a source of constant wonder and 
speculation. The latest attempt to shed light 
on its roots is presented by Papadopoulos 
et al.' on page 537 of this issue. 

Preferential attachment made its first appear- 
ance in 1923 in the celebrated urn model of the 
Hungarian mathematician Gyorgy Pélya’, 
and it has reappeared repeatedly over the past 
century, particularly in the social sciences. 
Although Robert Merton named it the Matthew 
effect’ in 1968 after the Gospel of Matthew, 
“For everyone who has will be given more, and 
he will have an abundance’, its current usage 
emerged only in 1999, with the discovery that 
it accounts for the power-law distributions 
observed in several real networks’. 

A new node joining a network, such as a 
new web page or a new protein, can in prin- 
ciple connect to any pre-existing node. How- 
ever, preferential attachment dictates that its 
choice will not be entirely random, but linearly 
biased by the degree of the pre-existing 
nodes — that is, the number of links that the 
nodes have with other nodes. This induces 
a rich-get-richer effect, allowing the more- 
connected nodes to gain more links at the 
expense of their less-connected counterparts. 
Hence, the large-degree nodes turn into hubs 


*This article and the paper under discussion! were 
published online on 12 September 2012. 


New node 


Figure 1 | Randomness or optimization? Two 
families of models could explain the origin of 
preferential attachment in networks, according to 
which the probability II(k) that a new node links to 
a pre-existing node that has degree k (the number 
of links that the node has with other nodes) is 
proportional to k. One family of models, to which 
the model introduced by Papadopoulos et al.’ 
belongs, assumes that preferential attachment is 
rooted in an optimization framework (right side). In 
these models, a new node will connect to the node 
that is most similar to it (most similar colour) but 
also has the largest degree. The central node offers 
the best balance between these two options. The 
other family of models relies on randomness (left 
side). In this case, the new node is colour-blind, so it 
randomly selects a link and connects 

to its target. Once again, the central node, which has 
the most links pointing to it, has the highest chance 
of being selected. 


and the network becomes scale-free — the 
probability distribution of the degrees over 
the entire network follows a power law. This is 
a frail set-up, as any nonlinearity in preferen- 
tial attachment will either eliminate the hubs 
or generate super-hubs, leading to the loss of 
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the scale-free property’. However, in every 
system in which it has been possible to measure 
preferential attachment, a linear form has 
been detected”’. 

The centuries-old proverb ‘birds of a feather 
flock together’ captures the idea that humans 
tend to hang out with those who are similar 
to them. Sociologists call this homophily, 
and it is perhaps one of the best documented 
concepts in the social sciences. Papadopou- 
los et al.’ propose that homophily might also 
contribute to preferential attachment. They 
introduce a model in which each node is 
assigned a randomly chosen position along a 
circle that serves as a ‘homophily space’: the 
closer two nodes are to each other on the cir- 
cle (that is, the smaller the angle 6 spanned 
by the nodes when measured from the circle’s 
centre), the more similar they are (see Fig. 1 
of the paper’). The network expands through 

the addition of new nodes, such that a 

node added at time t = 1, 2,... will choose 

to connect to a pre-existing node added at 
time s only if node s offers the smallest of all 
possible products s6,,, where 6,, is the angu- 
lar distance between nodes s and t. Hence 
the new node optimizes its choice between 
two often conflicting interests: the node it 
will link to should be the most connected (the 
oldest, with the smallest s) and the most 
similar to it (the smallest @,,). 

Interestingly, by placing each node at 
distance r, = Int from the centre of the 
homophily circle, the authors find that the 
network evolves not on the circle but in a 
hyperbolic space, a geometrical space that 
is familiar mainly to those well versed in 
cosmology and general relativity. In this space, 
strange things can happen, such as parallel 
lines meeting each other and triangles that 
have zero-degree angles. Yet the model has 
its simplest interpretation in this peculiar 
space, where new nodes simply connect to the 
nodes closest to them. The authors show that 
the resulting network is scale-free and that a 
linear preferential attachment is the model’s 
emerging feature. 

The new model fuels a slowly evolving 
debate — is preferential attachment rooted 
in pure chance or in some form of optimiza- 
tion? Indeed, the most accepted mechanisms 
of preferential attachment rely on dumb luck. 
The simplest one is this: first randomly select 
a link in a directed network, for example the 
links of the Word Wide Web that point toa 
document; then connect the new node to the 
selected link’s target®. The more connected 
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nodes have an advantage here, as the chance 
that a new node connects to them is propor- 
tional to their degree. Variants of this simple 
mechanism lie behind the popular copying 
model proposed to explain the scale-free 
nature of the web’ and the emergence of hubs 
in protein-interaction networks through gene 
duplication'*"’. According to these mecha- 
nisms, preferential attachment does not 
require human agency, but is rather a conse- 
quence of purely random actions. By contrast, 
Papadopoulos and colleagues’ model calls for 
clear agency, as each new node seeks to link to 
the closest and oldest node. In this respect, the 
model supports earlier mechanisms, devel- 
oped in the context of the Internet’*”’, pro- 
posing that preferential attachment is rooted 
ina wish to balance distance to the target node 
with some utility, such as access to bandwidth. 

Both approaches are tempting. Random 
models ask little of us, and demonstrate how 
random actions can result in outcomes that are 
not so random. Yet we do not think that the 
choices we make are ever random, fuelling the 
attractiveness of models that invoke some form 
of optimization. 

This tension between two equally attrac- 
tive but apparently opposing alternatives is 


REGENERATIVE BIOLOGY 


by no means new. In the 1960s, the economist 
Herbert Simon and the mathematician Benoit 
Mandelbrot fought a fierce public dispute, 
with Simon defending the role of randomness 
and preferential attachment in explaining the 
power-law distribution of word frequencies in 
text, and Mandelbrot arguing for an optimiza- 
tion framework”. In the past decade, experi- 
mental evidence for preferential attachment 
in the context of networks has tilted the argu- 
ment in Simon's favour. And now the debate is 
shifting to a deeper question — whether pref- 
erential attachment is the outcome of random 
actions or optimization (Fig. 1). 

This debate helps us to understand how 
preferential attachment emerges in an identi- 
cal form in such widely different systems. The 
fact that the effect is widespread suggests that 
it probably derives from both agency and ran- 
dom actions. Most complex systems have a bit 
of both, so we do not need to choose between 
them. Luck or reason, preferential attach- 
ment wins either way. And so do we, gaining 
a deeper understanding of this puzzling yet 
ubiquitous force. = 
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Skin, heal thyself 


Simply grabbing an African spiny mouse can cause it to lose up to 60% of the skin 
on its back. Analysis of the mouse’s astounding shedding and healing capacity 
provides insight into the biomechanics of tissue regeneration. SEE LETTER P.561 


ELLY M. TANAKA 


ost people know that lizards can 
Mi predators by jettisoning their 

tail, a process called autotomy. The 
tail musculature and bone organization of 
these reptiles allow breakage at defined planes 
when the need arises’ and, after healing, the tail 
is restored as an imperfect but functional rep- 
lica of the original’. By contrast, only owners 
of exotic pets may have witnessed the esca- 
pades of the African spiny mouse (genus 
Acomys), which shimmies away by losing its 
tail skin. On page 561 of this issue, Seifert et al. ? 
reveal that the skin shedding and regeneration 
observed in the tail extend to other parts of the 
mouse’ body, paving the way for detailed stud- 
ies of the molecular and structural basis of this 
fascinating — and potentially therapeutically 
useful — ability. 

Previous analysis* showed that the tail skin 
of spiny mice is more loosely attached to the 
underlying muscle and bone than the tail 
skin of other rodents such as the common 
house mouse (Mus musculus), a feature that 


probably facilitates its shedding in Acomys. To 
assess further the skin properties of spiny mice, 
Seifert and colleagues examined wild-caught 
specimens of two species, Acomys kempi and 
Acomys percivali. Even as the researchers col- 
lected them, they noticed that the animals lost 
up to 60% of the skin on their back when they 
were grabbed by hand in a normal manner (see 
Fig. 1c of the paper’). 

When the researchers put Acomys skin 
under defined loads, they found it to be very 
brittle: it had a 20-fold lower tensile strength 
than skin from M. musculus, and 77-fold less 
energy was required to elicit skin tearing. The 
authors observed no predefined breakage- 
point structures in Acomys skin, suggesting 
that an overall fragility accounts for the ease 
in shedding. It is not yet clear what molecular 
or biomechanical properties underlie this fra- 
gility, although Seifert et al. point out that the 
hair follicles are larger and take up a greater 
proportion of the skin’s surface in Acomys than 
in Mus species, which might have some effect. 

Perhaps even more impressive than the 
shedding ability of Acomys is their healing 
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response. Skin consists of an outer epidermal 
layer and an inner dermal layer. Seifert et al. 
report that the animals regrew skin with hair 
within 30 days of damage, and that the epider- 
mis re-formed more rapidly in Acomys species 
than in M. musculus. Furthermore, they 
found that less underlying wound-bed tissue 
(which is associated with scarring) formed in 
Acomys, and that this tissue was composed pre- 
dominantly of the extracellular matrix (ECM) 
molecule collagen IIL, rather than of aligned 
fibrils of collagen I, which is characteristic 
of scar formation in mammals. Finally, the 
authors observed that the healing epidermal 
cells of Acomys formed new hair follicles, 
and that the signalling pathways involved in 
follicle formation resembled those used during 
embryonic hair development. 

In skin healing in other mammals, de novo 
hair-follicle formation is considered difficult 
to achieve — although not unheard of. Hair- 
follicle regeneration has been seen in large 
wounds made in rabbits and in M. musculus**. 
In these conditions, epidermal cells ‘crawl’ over 
the open wound to form a new epidermal layer 
that contains many hair follicles. Interest- 
ingly, these new hairs remain unpigmented, 
whereas it can be seen in Seifert and colleagues’ 
paper that the new Acomys hairs are the 
usual colour. 

Taken together, these results indicate that 
the epidermal cells that cover wounded tissue 
during healing can establish interactions with 
the underlying cells that promote hair-follicle 
formation. Cross-talk between the epidermis 
and underlying mesenchymal cells is known 
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Ear punch hole 


Figure 1 | Wound repair in African spiny mice. Seifert et al.’ studied tissue regeneration in Acomys 
mice after a hole 4 millimetres in diameter was punched in the ear. They show that the holes were filled 
in by regeneration of cartilage, adipose tissue and skin, which included the dermal and epidermal layers 
and contained hair follicles. The process involved rapid re-formation of the skin’s outer layer by crawling 
epidermal cells. Underneath this layer, proliferation of mesenchymal cells led to the formation of a cell 
mass (which also contained the proteins fibronectin and tenascin) that looks similar to the blastema 
formed during limb regeneration in salamanders, which is known to promote regeneration of structures 
such as bone and nervous tissue’. During this process, the basement-membrane layer of the extracellular 
matrix that usually separates the dermis from the epidermis was absent. 


to be crucial for embryonic hair-follicle 
development (for a review, see ref. 7). It seems 
that this cross-talk can still occur in adult 
mammalian skin during wound healing, but 
that in many mammalian species the initia- 
tion of epidermal crawling is slow to occur. 
By contrast, Acomys wound healing seems to 
be characterized by rapid re-epithelialization 
(during development and wound healing, the 
cells that will form the epidermis are called 
the epithelium) and hair-follicle regeneration. 
It is worth noting that extremely rapid re- 
epithelialization is also a hallmark of limb and 
tail regeneration in amphibians such as frogs 
and salamanders’. 

Finally, Seifert and colleagues further tested 
the regenerative capacity of Acomys tissue by 
assessing the response of the animals’ ear tis- 
sue to having a hole punched through it. They 
found that the holes were filled in by cartilage, 
adipose, dermal and epidermal structures, 
although not by muscle. In fact, the regenera- 
tion process was, in many ways, similar to that 
of complete limb regeneration in salamanders. 
In these amphibians, the stump of an ampu- 
tated limb is rapidly covered by a layer of non- 
proliferating epidermal cells. Proliferation of 
mesenchymal progenitor cells in the under- 
lying tissues then produces a cell mass called 
the blastema’. Communication between the 
epidermal layer and the blastema promotes the 
regeneration of bones, peripheral nerves and 


muscles’’. During this process, the basement- 
membrane layer of the ECM that normally 
separates the epidermis from the dermis is 
absent, and the blastema cells are surrounded 
by the ECM molecules fibronectin, hyaluronic 
acid and tenascin’. 

Seifert et al. report that Acomys ear-tissue 
regeneration closely resembles this process: 
there was rapid formation of a non-prolifer- 
ative epidermal layer accompanied by mes- 
enchymal cell proliferation; no basement 
membrane was established between the wound 
epidermis and the mesenchymal layer; and the 
mesenchymal tissue was rich in fibronectin, 
tenascin and collagen III (Fig. 1). By contrast, 
when the authors assessed the ear-punch 
response in M. musculus (ear holes do not 
regenerate in these mice), the authors observed 
only initial, but not persistent, mesenchymal 
cell proliferation. Furthermore, the M. muscu- 
Ius mesenchymal cells expressed the smooth- 
muscle protein actin, which is characteristic 
of cells laying down fibrils of collagen I that 
contribute to scar tissue. The authors observed 
only weak actin expression in Acomys. 

Although ear-punch regeneration has been 
described previously in rabbits, some strains 
of mice and other mammals'*"*, Seifert and 
colleagues’ analysis of this response in spiny 
mice confirms that rapid re-epithelialization 
and a distinct profile of ECM deposition 
correlate with successful tissue regeneration 
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in mammals. These studies suggest that the 
pathways leading to regeneration, at least of 
the skin, that are normally associated with 
amphibians are also accessible in mam- 
mals. Unlocking this cascade in a controlled 
manner in other wound situations may help 
to promote scarless healing. However, one 
aspect that the authors did not address in 
these studies is whether alteration of the 
immune response, which has been proposed 
to contribute to scar-free healing and success- 
ful regeneration, is another feature of Acomys’s 
impressive regenerative abilities°. How 
an animal can lose 60% of its back skin and 
yet survive both microbial attack and tissue 
desiccation will be another intriguing aspect 
for future studies. m 


ELECTRONICS 


Elly M. Tanaka is at the DFG Center 
for Regenerative Therapies, Technische 
Universitdt Dresden, 01309 Dresden, 
Germany. 

e-mail: elly.tanaka@crt-dresden.de 


1. Pratt, C. W. J. Anat. 80, 184-188 (1946). 

2. Simpson, S. B. JrAm. Zool. 10, 157-165 
(1970). 

3. Seifert, A. W. et al. Nature 489, 561-565 (2012). 

4. Shargal, E., Rath-Wolfson, L., Kronfeld, N. & 
Dayan, T. J. Zool. 249, 187-193 (1999). 

5. Breedis, C. Cancer Res. 14, 575-579 (1954). 

6. Ito, M. et al. Nature 447, 316-320 (2007). 

7. Schneider, M. R., Schmidt-Ullrich, R. & Paus, R. 
Curr. Biol. 19, R132-R142 (2009). 

8. Repesh, L. A. & Oberpriller, J.C. Am. J. Anat. 159, 
187-208 (1980). 

9. Hay, E. D. & Fischman, D. A. Dev. Biol. 3, 26-59 
(1961). 


‘Cut and stick’ ion gels 


Ion gels are composites of ionic liquids and polymers. Free-standing forms of ion 
gels have now been made that can be neatly cut with a razor blade and stuck onto 
semiconductor materials to make transistors. 


MASASHI KAWASAKI & YOSHIHIRO IWASA 


obile phones cannot function without 
Me battery and the billions of tiny 
electronic switches (transistors) 
that they contain. The transistors control the 
flow of electrons in semiconductors to allow 


the processing and storage of information, 
whereas the battery stores electrochemical 
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energy to power the transistors. The first 
transistor was invented in 1947 by William 
Shockley, John Bardeen and Walter Brattain. 
Interestingly, the initial experiments that led 
to the realization of the device used a liquid 
electrolyte — a mixture of a liquid solvent 
and ions — to control the electron trans- 
port in a semiconductor. In other words, the 
first true transistor was not an all-solid-state 
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Figure 1 | An ion-gel transistor. Lee and colleagues” have made a free-standing ion gel that can be placed 


on a semiconductor to make a transistor. The ion gel serves as the device's gate insulator, separating the 
gate electrode from the semiconductor that forms the channel between the source and drain electrodes 
and which is laid ona substrate. When the device is switched on, an electrical double layer (EDL) of 
charges forms at the interface between the ion gel and the semiconductor, giving the device a large 
electrical capacitance. 
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device but an electrochemical one’. 

Because of the enormous success of solid- 
state electronics, this early history of transis- 
tors and the usefulness of liquid electrolytes 
in electronic devices have been somewhat 
forgotten. In the past decade, however, tran- 
sistors based on electrolytes and electrochemi- 
cal concepts have been receiving renewed 
attention, because they offer functionalities 
beyond those of conventional, all-solid-state 
transistors. Writing in Advanced Materials, 
Lee et al.” report a rubber-like material that is 
made from an ionic liquid (a salt that exists 
in the liquid state at room temperature) anda 
polymer and which can be ‘cut and stuck’ onto 
semiconductors to make a transistor. 

Electrolytes are ionically conducting but 
electronically insulating substances. When an 
electrolyte is placed on an electronically con- 
ducting solid, an electrical double layer (EDL) 
is formed at the interface, an effect that was 
first modelled by Helmholtz’ in 1853. Owing 
to the nanoscale gap between its two charged 
layers, the EDL has a huge capacitance (ability 
to store charge) — of the order of 1-10 micro- 
farads per square centimetre (uF cm”). This 
value is 10-1,000 times greater than that of 
conventional electrical capacitors, which are 
based on solid insulators. 

By using an EDL in place ofa solid insulator, 
which is typically made of silicon dioxide or 
aluminium oxide, researchers have made tran- 
sistors that show surprising and potentially 
useful effects, including electric-field-induced 
phase transitions such as insulator-to-metal 
transitions’, superconductivity and ferro- 
magnetism’. In most cases, ionic liquids were 
used because they have high ionic conductivity 
and yield a large EDL capacitance. However, 
for electronic devices, materials that have 
mechanical and structural integrity are often 
required. 

But in the twenty-first century, we no longer 
have to rely solely on solid-state materials to 
attain such integrity. Ion gels — a mixture of 
ionic liquids and macromolecular compo- 
nents — offer an alternative. They combine 


the advantages of ionic liquids and solid 
insulators, including manipulation at room 
temperature’. Lee et al.* now demonstrate 
free-standing, rubber-like ion gels that can be 
cut with a razor blade and stuck onto several 
semiconductor materials to produce a transis- 
tor. Their approach therefore offers a neat way 
of making transistors. 

The macromolecular components of ion gels 
are block copolymers — polymers in which two 
or more chemically distinct monomer units 
are linked together. In their experiments, Lee 
et al. made free-standing 0.6-millimetre-thick 
films of ion gels by using a weight ratio of block 
copolymer to ionic liquid of 1:4. The capaci- 
tance of the EDL that formed at the interface 
between the semiconductor and the ion gel was 
as high as 10 uF cm ~. Similar free-standing ion- 
gel films have been used as solid electrolytes in 
lithium-ion batteries. But Lee et al. used such 
films as gate insulators in transistors; gate insu- 
lators separate the transistor’s semiconducting 
channel from its gate electrode (Fig. 1). What's 
more, the authors demonstrated transistors 
made from both typical organic and inorganic 
semiconductors; these showed current based 
on ‘holes’ (notional particles formed by the 
absence of electrons) and electrons, respec- 
tively. This method can thus be applied to any 
type of semiconducting material. 


CANCER METABOLISM 


Because ion gels can be made in many forms 
(there are large numbers of ionic liquids and 
polymers), they could be used to obtain diverse 
novel device structures and functionalities’. For 
instance, printing of ion gels on substrates has 
already been demonstrated’. Finally, combina- 
tions of ion gels and a variety of electronic con- 
ductors — not only semiconductors but also 
metals, superconductors and magnets — will 
make ‘iontronics”, electronics based on ionic 
functions, a rich field of materials science. m 
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When more is less 


A tightly regulated enzyme balances energy production and the synthesis 
of macromolecules from glucose in cancer cells. Upsetting this balance by 
stimulating the enzyme’s activity can suppress tumour growth in mice. 


LEI JIANG & RALPH J. DEBERARDINIS 


he development of cancer is associated 
| with a suite of metabolic changes that 
support the energetic and biosynthetic 
requirements of tumour-cell proliferation. 
Many of these changes are stimulated by the 
same genetic mutations that drive tumori- 
genesis, suggesting that normalizing the 
tumour cells’ metabolism by pharmacological 
means could suppress cancer progression. An 
enhancement of glycolysis, a cellular pathway 
that produces both energy and precursors of 
macromolecules, is a classic metabolic hall- 
mark of cancer, making the glycolytic pathway 
an attractive target in which to test this hypoth- 
esis. Writing in Nature Chemical Biology, 
Anastasiou et al.’ show that targeting a form of 
the enzyme pyruvate kinase that is commonly 
found in tumour cells can redistribute the fate 
of glucose-derived metabolites away from 
biosynthetic pathways, and suppress tumour 
growth in mice. 


The ability of the glycolytic pathway to 
produce ATP — the main cellular energy- 
transfer molecule — is largely dependent on 
the activity of pyruvate kinases. There are vari- 
ous mammalian forms of pyruvate kinase, and 
tumour cells typically express the M2 isoform 
(PKM2), which shows reduced enzyme activ- 
ity compared with other pyruvate kinases. This 
was initially thought to be somewhat paradoxi- 
cal in cancer cells, in which abundant expres- 
sion of glucose transporters and glycolytic 
enzymes conspires to enhance overall glyco- 
lysis. But one advantage of a bottleneck at the 
end of an otherwise active pathway is to force 
some of the intermediates it produces to accu- 
mulate, analogous to a partially closed valve at 
the bottom of a column of fluid (Fig. 1). It is 
now thought that the aggregation of metabo- 
lites behind this valve may promote their 
transfer into accessory pathways, including 
those that generate macromolecular building 
blocks and other substrates needed for cellular 
replication’. 
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50 Years Ago 


The third annual scientific meeting 
of the British Academy of Forensic 
Sciences ... was called to consider 
“The Hazards of the Road’, a 
subject to which, having regard 

to the increasing public danger 
deriving from motor traffic, the 
sciences appear to have devoted 
disproportionately little research 
... [P]sychological tests on those 
involved in accidents ... tended 

to show personality traits such 

as aggressiveness, carelessness, 
impulsiveness and the like which 
are not present to the same degree 
in those persons who have been 
free from accidents ... Older 
people tend to make up for their 
longer reaction times by driving 
more slowly ... Although as a man 
grows older, into middle age or 
further, his reaction time is slowed, 
his eyesight is not quite what it used 
to be and his hearing is impaired, 
he nevertheless becomes a 

better driver. 

From Nature 29 September 1962 


100 Years Ago 


When nature was fashioning man 
the forces of natural selection 
made one hand more apt to 
perform skilled movements 

than the other. Why precisely 

it was the right hand that was 
chosen in the majority of mankind 
we do not know ... The fact that 

a certain proportion of mankind 
is left-handed, and that such a 
tendency is transmitted to only 
some of the descendants of a left- 
handed person, might perhaps 
suggest that one half of mankind 
was originally left-handed and the 
other right-handed, and that the 
former condition was recessive 

in the Mendelian sense ... The 
superiority of one hand is as old 
as mankind, and is one of the 
factors incidental to the evolution 
of man. 

From Nature 26 September 1912 
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Figure 1 | A metabolic bottleneck. During glucose metabolism (glycolysis), the enzyme pyruvate kinase 
acts asa ‘tap’ that regulates the rate of flow out of the pathway. a, Cancer cells often express the M2 form 
of the enzyme (PKM2), which has lower enzymatic activity than other isoforms of pyruvate kinase. This, 
combined with the robust glycolysis that is also a hallmark of cancer cells, leads to an accumulation of 
glycolytic intermediates, including precursors of nucleic acids, lipids, serine, glycine and methyl groups. 
The accumulation in turn stimulates the biosynthetic pathways into which these intermediates feed, 
thereby contributing to the rapid cell proliferation seen in tumours. b, By contrast, enhancing pyruvate 
kinase activity by replacing PKM2 with the more active PKM1 isoform causes a decline in these pools 

of biosynthetic precursors and reduced cell proliferation’. Anastasiou et al.' show that a similar tumour- 
suppressive effect can be achieved using small molecules that enhance PKM2 activity. 


This explanation for the role of PKM2 in 
cancer gained momentum when it was dem- 
onstrated that replacing PKM2 in cancer cells 
with the more active PKM1 isoform resulted 
in higher overall activity of pyruvate kinase 
but reduced macromolecular synthesis and 
tumour growth in mice’. Further support 
came from studies showing that many of the 
processes that regulate PKM2 in cancer cells, 
such as binding of the enzyme by tyrosine- 
phosphorylated proteins, act to reduce rather 
than stimulate its enzymatic activity*”. 

A possible implication of this picture of 
cancer-cell glycolysis is that stimulating PKM2 
activity — in effect, opening the floodgates — 
might impair tumour growth by reducing the 
availability of macromolecular precursors. 
Anastasiou et al. tested this idea by using small 
molecules that clamp the enzyme into a highly 
active tetramer configuration (a complex con- 
taining four subunits of the enzyme), and then 
analysing tumour-cell glycolysis. They found 
that cellular pools of glucose-dependent bio- 
synthetic intermediates became depleted, 
and that flux through the pathways supplied 
by them was suppressed. Treatment with 
the PKM2 activator molecules also reduced 
the proliferation of cancer cells subjected to 
low oxygen levels, a condition that favours 
glycolysis. Finally, when the authors treated 


tumour-bearing mice by oral administration of 
PKM2 activators, they found suppressed levels 
of biosynthetic intermediates in the tumours, 
and substantially slower tumour growth. 
Importantly, the small molecules also ren- 
dered PKM2 insensitive to blockade by tyros- 
ine-phosphorylated proteins, suggesting that 
the agents might provide durable metabolic 
effects even in tumours that are programmed 
to reduce the activity of pyruvate kinases. 
These studies of PKM2 activation provide 
a foundation for broader investigations of 
metabolism and cancer biology. Anastasiou 
and colleagues’ results indicate that PKM2 
affects biosynthesis in more subtle ways than 
those shown in Figure 1. For example, they 
found that activating PKM2 reduces the flux 
from glucose metabolism to fatty-acid syn- 
thesis, a pathway typically thought to involve 
the formation of pyruvate (a glycolytic inter- 
mediate molecule) by pyruvate kinases. 
Together, these new data imply that submaxi- 
mal PKM2 activity helps to channel metabo- 
lites into biosynthetic pathways through 
mechanisms that do not simply involve an 
accumulation of intermediates, but that are 
not yet fully understood. It will be interesting 
to test whether modulating glycolysis through 
PKM2 activation or other mechanisms can 
be extended to suppress the proliferation 
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of cells under conditions of normal oxygen 
levels, because such suppression would also be 
desirable in fast-growing areas of tumours that 
are well perfused by oxygen. 

The push to understand PKM2 function, 
and now to manipulate it to suppress tumour 
growth, emphasizes a shift in focus in cancer 
metabolism research. Since Otto Warburg's 
early experiments on tumour glycolysis in the 
1920s, work in this field has largely centred 
on how tumours produce energy. But recent 
studies have taken a much broader view of the 
metabolic network in cancer cells, leading to a 
greater appreciation of the complexities of bio- 
synthesis, reduction-oxidation homeostasis 
and other facets of metabolism that support 
cell survival and proliferation. These inves- 
tigations have uncovered many unexpected 
roles for metabolism in cell signalling and the 
regulation of gene expression, extending the 
reach of metabolic enzymes into essentially 
every area of cell biology®. These newly iden- 
tified functions are highly relevant to cancer. 
Consider, for example, that PKM2 also acts as 
an activator of gene expression, such that it 
has proliferation-promoting activities that are 
independent of its enzymatic function’. 

Finally, Anastasiou et al. have illustrated an 
important concept in the metabolic control 
of tumorigenesis. They show that activating 
PKM2 suppresses cell proliferation — but 
severely inhibiting this enzyme is known to 
have similar effects, and can induce tumour 
regression in mice®”. It is likely that the high- 
est rates of cell proliferation are the result of a 
metabolic network in which a set of enzymes 
that, like PKM2, contribute to biosynthesis are 
precisely controlled, but whose maximal acti- 
vation is counterproductive. These nodes may 
be the most easily disrupted control points 
in the metabolic network, and identifying 
them should offer the best opportunities for 
metabolic therapy in cancer. m 
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Genomic analysis of a Key innovation in an 
experimental Escherichia coli population 


Zachary D. Blount'”, Jeffrey E. Barrick***, Carla J. Davidson? & Richard E. Lenski!? 


Evolutionary novelties have been important in the history of life, but their origins are usually difficult to examine in 
detail. We previously described the evolution of a novel trait, aerobic citrate utilization (Cit*), in an experimental 
population of Escherichia coli. Here we analyse genome sequences to investigate the history and genetic basis of this 
trait. At least three distinct clades coexisted for more than 10,000 generations before its emergence. The Cit* trait 
originated in one clade by a tandem duplication that captured an aerobically expressed promoter for the expression of 
a previously silent citrate transporter. The clades varied in their propensity to evolve this novel trait, although genotypes 
able to do so existed in all three clades, implying that multiple potentiating mutations arose during the population’s 
history. Our findings illustrate the importance of promoter capture and altered gene regulation in mediating the 
exaptation events that often underlie evolutionary innovations. 


Evolutionary novelties are qualitatively new traits that open up eco- 
logical opportunities and thereby promote diversification’. These 
traits are thought to arise typically by the exaptation of genes that 
previously encoded other functions”® through processes such as 
domain shuffling’, altered regulation® and duplication followed by 
neo-functionalization”’®. Multiple mutations may be necessary to pro- 
duce the new function®”’, and thus its potential to evolve may be 
contingent on subtle differences between species, populations or even 
genotypes. A complete understanding of the evolution of a novel trait 
requires explanation of its ecological function, its physiological basis, 
the underlying mutations and the history of the accumulated changes’. 

Evolution experiments with microorganisms offer unparalleled 
opportunities to assess the evolution of novel traits’? '°. Microbes 
have rapid generations and large populations, and new technologies 
allow discovery of mutations throughout their genomes'*’’. Samples 
can be frozen and revived, allowing phylogenies and mutational 
histories to be constructed and analysed’*””. 

Twelve populations of Escherichia coli have been propagated in the 
long-term evolution experiment (LTEE) for over 40,000 generations 
in a glucose-limited minimal medium’*. The medium also contains 
abundant citrate, which is present as a chelating agent”, but E. coli 
cannot exploit citrate as a carbon and energy source in the well- 
aerated conditions of the experiment*'’’. The inability to grow 
aerobically on citrate is a long-recognized trait that, in part, defines 
E. colias a species”’. Spontaneous citrate-using (Cit*) mutants are extra- 
ordinarily rare*‘, but a Cit* variant evolved in one population (desig- 
nated Ara—3) around 31,000 generations”. Cit™ cells became dominant 
after 33,000 generations, although Cit” cells persisted. This shift was 
accompanied by a several-fold increase in total population size owing 
to the high concentration of citrate relative to glucose in the medium. 

The emergence of the Cit” trait was contingent upon one or more 
earlier mutations in the population’s history. When evolution was 
replayed from clones isolated at various time points, clones from later 
generations were more likely to produce Cit" mutants than did the 
ancestor and other early clones”’. This finding implied that a genetic 
background evolved that ‘potentiated’ the evolution of this trait. In 


principle, this effect could involve two distinct mechanisms. One 
possibility is that the rate of mutation, or certain types of mutation, 
increased such that the required event was more likely to occur in later 
generations. The other possibility is an epistatic interaction, such that 
the expression of a mutation that produced the Cit* trait required one 
or more preceding mutations. Here we report the results of extensive 
whole-genome re-sequencing that allowed us to reconstruct the 
population’s history, identify mutations underlying the Cit" pheno- 
type, and elucidate the physiological basis of this novel trait. 


Genome sequences and phylogeny 


We sequenced 29 clones sampled at various generations from popu- 
lation Ara-3, including nine Cit’ and three Cit” clones known to be 
potentiated” (Supplementary Table 1). Mutations including single- 
nucleotide polymorphisms (SNPs) as well as deletions, insertions and 
some chromosomal rearrangements were identified (Supplementary 
Table 2) by comparing reads to the genome of the ancestral strain, 
REL606 (ref. 25). Inversions and other rearrangements that involved 
long sequence repeats would have escaped detection. 

We reconstructed the population’s phylogenetic history from 
presence-absence matrices of all mutations identified in the sequenced 
genomes. Figure 1 shows that the population was polymorphic for 
most of its history. Several clades, each represented by multiple clones, 
arose before 20,000 generations. One of them, which we call UC 
(unsuccessful clade), was not seen after 15,000 generations. Clades 
Cl, C2 and C3 coexisted through the evolution of Cit* and beyond. 
Cl includes four clones, the earliest from generation 25,000, although a 
molecular clock (Fig. 1 inset) implies that Cl diverged from the 
ancestor of C2 and C3 before 15,000 generations. C2 and C3 had 
diverged by generation 20,000. C3 includes both Cit’ clones and all 
the Cit* clones, although the first Cit* cells did not arise until ~31,000 
generations. Two mechanisms may explain the prolonged coexistence 
of the Cit” lineages. First, they may all have acquired beneficial muta- 
tions without one gaining enough advantage to fix’. Alternatively, 
they might have filled subtly different niches, such as through the 
differential use of secreted metabolites”. 
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Figure 1 | Phylogeny of Ara-3 population. Symbols at branch tips mark 29 
sequenced clones; labels are shown for clones mentioned in main text and 
figures. Shaded areas and coloured symbols identify major clades. Fractions 
above the tree show the number of clones belonging to the clade that yielded 
Cit* mutants during replay experiments (numerator) and the corresponding 
total used in those experiments (denominator). Inset shows number of 
mutations relative to the ancestor. The solid line is the least-squares linear 
regression of mutations in non-mutator genomes; the dashed line is the 
corresponding regression for mutator genomes. 


The sequenced Cit* clones from generation 36,000 and later have 
an SNP in the mutS gene that produces a premature stop codon and 
truncates the MutS protein, thereby causing a defect in methyl- 
directed mismatch DNA repair”. We sequenced mutS from three 
Cit* clones from each of generations 33,000, 34,000, 35,000, 36,000, 
37,000 and 38,000, and found this SNP in 0, 0, 2, 3 and 3 clones, 
respectively, indicating that the mutation arose after the origin of 
the Cit” lineage. Also, none of the 20 Cit” genomes has this mutation. 
As a consequence, Cit” genomes from later generations accumulated 
SNPs much faster than Cit” and early Cit" genomes (Fig. 1 inset). 
Mutator phenotypes evolved in several other populations in the 
LTEE'*®, so this change was not unique to the Cit” lineage. 


Evolution of the Cit* function 


The evolution of the Cit* trait involved three successive processes: 
potentiation, actualization and refinement. The ancestor’s rate of 
mutation to Cit’ was immeasurably low, with an upper bound of 
3.6 X 10’? per cell-generation”’. Potentiation refers to the evolution 
of a genetic background in which this function became accessible by 
mutation. An extremely weak Cit* variant had emerged by 31,500 
generations, which represents the actualization step. The new func- 
tion was then refined, which allowed the efficient exploitation of 
citrate, the rise of the Cit” subpopulation to numerical dominance, 
and the expansion of the total population size. Of these processes, 
actualization is the most tractable for study owing to the discrete 
phenotypic change, and we therefore focused on identifying and 
characterizing the mutational basis of actualization. With that 
information in hand, additional analyses then shed new light on the 
processes of refinement and potentiation. 
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Actualization of the Cit* function 


One reason that E. coli cannot grow aerobically on citrate is its 
inability to transport citrate***'*, The origin of the Cit phenotype 
therefore required expression of a citrate transporter. All nine 
sequenced Cit’ genomes have two or more tandem copies of a 
2,933-base-pair segment that includes part of the citrate fermentation 
(cit) operon (Fig. 2a). The amplified segment contains two genes: rna, 
which encodes RNase I’, and citT, which encodes a broad-spectrum 
C,-di- and tri-carboxylic acid transporter that functions in fermenta- 
tion as a citrate/succinate antiporter. The boundary upstream of citT 
is in the 3’ end of the citG gene, which encodes triphosphoribosyl- 
dephospho-CoA synthase, whereas the boundary downstream of 
rna is in the 5’ end of rnk, which encodes a regulator of nucleoside 
diphosphate kinase**. This amplification is not present in the ancestor 
or any of the sequenced Cit’ genomes, and it is only found in popu- 
lation samples after the evolution of the Cit” lineage (Supplementary 
Table 3). Polymerase chain reaction (PCR) screens also failed to detect 
this segment in 27 Cit” clones from generations 33,000 through 40,000, 
whereas it was found in all 33 Cit* clones from the same generations 
(Supplementary Table 4). 

Amplification mutations can alter the spatial relationship between 
structural genes and regulatory elements, potentially causing altered 
regulation and novel traits****. The structure of the cit amplification 
led us to propose that the Cit* trait arose from an amplification- 
mediated promoter capture (Fig. 2b and Supplementary Fig. 1). The 
amplification joined upstream rnk and downstream citG fragments, 
producing an rnk-citG hybrid gene expressed from the upstream rnk 
promoter. Because the citT and citG genes are normally monocistronic, 
the downstream copy of citT should therefore be co-transcribed with 
the hybrid gene. If the rnk promoter directs transcription under oxic 
conditions, then the new rnk-citT regulatory module might allow CitT 
expression during aerobic metabolism and thereby confer a Cit* 
phenotype”. 

To test this hypothesis, we first examined the capacity of the rnk- 
citT module to support citT expression in oxic conditions. We con- 
structed a low-copy (one to two per cell) plasmid, pCDrnk-citTlux, 
with an rnk-citT module in which citT was replaced by the luxCDABE 
reporter operon. We made two other plasmids, pCDrnklux and 
pCDcitTlux, in which the reporter was under the control of the native 
upstream regulatory regions of rnk and citT, respectively. We trans- 
formed each plasmid into REL606, the ancestral strain; ZDB30, a 
potentiated C3 clone from generation 32,000; and ZDB172, a weakly 
Cit* clone from generation 32,000. We measured expression (light 
production) during growth and stationary phase under oxic condi- 
tions (Fig. 3). The native citT regulatory region showed no expression 
(above background) in any strain, indicating that citT is normally 
silent under oxic conditions. The native rnk regulatory region was 
expressed in all three strains, with a peak around the transition into 
stationary phase. Expression from the evolved rnk-citT module was 
much weaker, but there were small spikes in expression in ZDB30 
and ZDB172 coincident with peak expression from the native rnk 
regulatory region. These results indicate that the rnk-citT module 
can support citT expression during aerobic metabolism. 

This hypothesis also predicts that the introduction of the rnk-citT 
module should confer a Cit* phenotype, whereas the loss of the cit 
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Figure 2 | Tandem amplification in Cit* genomes. a, Ancestral arrangement 
of citG, citT, rna and rnk genes. b, Altered spatial and regulatory relationships 
generated by the amplification. 
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Figure 3 | Expression levels from native citT, native rnk and evolved rnk- 
citT regulatory regions during aerobic metabolism. Average time course of 
expression for the ancestral strain REL606 (left) and evolved clones ZDB30 
(centre) and ZDB172 (right), each transformed with lux reporter plasmids 
pCDcitTlux (blue), pCDrnklux (black) and pCDrnk-citTlux (red). Expression 
was measured as counts per second (c.p.s.). Each curve shows the average of 
four replicates. 


amplification should cause reversion to a Cit” state. We tested whether 
the rnk-citT module confers a Cit" phenotype by inserting a single 
copy of an amplification fragment containing the rnk promoter 
immediately upstream of the chromosomal copy of citT (Fig. 4a) in 
potentiated clone ZDB30. This construct, ZDB595, indeed has a cit* 
phenotype, although it is extremely weak, similar to the earliest evolved 
Cit* variants (Fig. 4b). In the same medium used in the LTEE, ZDB595 
experienced a lag of more than 60h after glucose depletion, followed 
by a short period of abortive and inconsistent growth on citrate 
(Fig. 4b, c). These data indicate that the initial Cit type was too weak 
to allow exploitation of citrate under the daily-transfer regime of the 
LTEE. Nonetheless, ZDB595 had a small (1.0%) but significant com- 
petitive advantage over ZDB30 in the same environment (n = 10, 
t = 3.09, two-tailed P = 0.0128). We also isolated 13 Cit” revertants 
of a 33,000-generation Cit’ clone, CZB154. All had lost the cit 
amplification based on both PCR and Southern blot analyses 
(Supplementary Fig. 2), further supporting the hypothesis that the 
amplification event had produced the Cit* phenotype. 


Refinement of the Cit* function 


Given the extremely weak initial Cit” phenotype, additional mutations 
must have refined the new function. Refinement is an open-ended 
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Figure 4 | New rnk-citT module confers Cit" phenotype in potentiated 
background. a, Engineered construct containing the cit amplification junction 
with the rnk promoter, and the arrangement following its insertion into the 
chromosome. b, Average growth trajectories in DM25 of potentiated clone 
ZDB30 (red), its isogenic construct with chromosomal integration of the rnk-cit 
module ZDB595 (blue), 31,500-generation Cit” clone ZDB564 (purple) and 
32,000-generation Cit’ clone ZDB172 (green). c, Trajectories of ZDB595 
compared to its parent. Light blue trajectories show heterogeneity among 
replicate cultures of ZDB595; dark blue and red are averages for ZDB595 and its 
parent, ZDB30, as in panel b (except different scale). 


ARTICLE 


process, and we focus only on early refining mutations that led to 
the rise of Cit* cells to high frequency and the concurrent expansion 
of the population. These mutations are evidenced by the improvement 
on citrate of Cit clones isolated between 31,500 and 33,000 genera- 
tions (Supplementary Fig. 3). We examined the genomes of the five 
sequenced Cit clones from before the population expansion, includ- 
ing one from generations 31,500, 32,000 and 32,500 and two from 
generation 33,000. We focused on the mutations on the line of descent 
for the Cit* subpopulation, where that line is defined by the presence of 
the same mutations in clones from generations 34,000, 36,000 and 
38,000 (Supplementary Tables 5-8). Two SNPs and one insertion 
sequence (IS) element were present in only one of the 33,000- 
generation genomes. The IS insertion and one SNP seem 
unrelated to growth on citrate. The remaining SNP is in the regulatory 
region of dctA, which encodes a transporter of succinate and other 
C,-di-carboxylic acids. This mutation may improve recovery of 
succinate exported in exchange for citrate. However, the other 
33,000-generation clone grew better on citrate (Supplementary Fig. 3). 
Thus, the dctA mutation does not seem to be responsible for the 
population expansion, although it might have been advantageous to 
its carriers. 

These early Cit" genomes also show increases in cit copy number. 
The earliest one had a tandem duplication, whereas later genomes had 
a three-copy tandem array within a larger tandem duplication, a four- 
copy tandem array, a tandem duplication in a larger three-copy 
tandem array and a nine-copy tandem array (Supplementary Table 9). 
Changes in amplification copy number readily occur by recombination, 
and they have been implicated in the refinement of other weak func- 
tions***', These changes increased the number of rnk-citT modules 
relative to the earliest Cit” genome (Fig. 5a) and presumably increased 
the expression of CitT as well. To test whether an increased number 
of rnk-citT modules could have caused the population expansion, 
we cloned the module (Fig. 5b) into the high-copy plasmid pUC19 
(ref. 42) and moved the resulting plasmid, pZBrnk-citT, into the 
potentiated clone ZDB30. The resulting strain, ZDB612, is strongly 
Cit", rapidly transitions from glucose to citrate and grows similarly 
to the 33,000-generation clone CZB152 (Fig. 5c). The increased 
number of rnk-citT modules can thus explain the refinement of the 
Cit* phenotype that allowed the population expansion. 
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Figure 5 | Refinement of Cit* phenotype by increased number of rnk-citT 
modules. a, Change in rnk-citT module copy number in sequenced Cit" clones 
over time. b, Structure of rnk-citT module cloned into high-copy plasmid 
pUC19 to produce pZBrnk-citT. c, Growth trajectories in DM25 of potentiated 
Cit” clone ZDB30 (red), 31,500-generation Cit* clone ZDB564 (purple), 
33,000-generation Cit* clone CZB152 (black), and ZDB612, a pZBrnk-citT 
transformant of ZDB30 (green). Each trajectory is the average of six replicates. 
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In contrast to the early variation in cit amplification, later cit* 
genomes have four-copy tandem arrays (Fig. 5a). Amplifications tend 
to be unstable***', and further refinement may have favoured more 
stable classes of mutations. The evolution of the mutator phenotype in 
the Cit’ lineage complicates efforts to identify these later refining 
mutations, but some interesting candidates include SNPs in citT 
itself: gitA, which encodes citrate synthase; and aceA, which encodes 
isocitrate lyase. 


Potentiation of Cit* evolution 


Before the Cit” trait could evolve, the Ara—3 population had to evolve 
a genetic background in which that new function was accessible by 
mutation. Potentiation was demonstrated by ‘replay’ experiments 
using 270 clones sampled over the population’s history*®. The replays 
produced 17 Cit’ mutants that derived from 13 clones, all from 
generation 20,000 or later. Fluctuation tests confirmed that poten- 
tiated clones had increased mutation rates to Cit", although such 
mutations were still extremely rare”’. 


Phylogeny implies multiple potentiating mutations 

The potentiating mutations are not known to confer any phenotype 
amenable to screening, so there is no simple way to distinguish 
between potentiated and non-potentiated clones. Instead, we 
examined the distribution of the 13 potentiated clones identified by 
the replay experiments using mutations (Supplementary Table 10) 
that differentiated clades UC, C1, C2 and C3 (Fig. 1). We also determined 
the distribution of the other 256 evolved clones used in the replays to 
assess the coverage of the clades in those experiments (Supplementary 
Fig. 4). Overall, 205 clones were assigned to clades, including 12 
potentiated clones (Supplementary Table 11). Sixteen from genera- 
tions 15,000 and earlier were in clade UC. The others came from 
generations 20,000 and later including 55 in C1, 97 in C2, and 37 in 
C3. Potentiated clones occurred in all three with 8 in C3 and 2 each in 
C1 and C2. Nonetheless, this distribution is highly non-random (two- 
tailed Fisher’s exact test comparing C3 with Cl and C2 combined, 
P=0.0003; this test shows no difference between Cl and C2, 
P=0.6206). These data, and the absence of any Cit’ mutants 
generated by the ancestor, indicate that potentiation involved at least 
two mutations, with one arising before these three clades diverged and 
another in C3 (Fig. 1). 


Alternative hypotheses 

Two distinct mechanisms might explain the potentiation effect. One 
is epistasis, whereby an interaction between the potentiating back- 
ground and the actualizing mutation is needed to express the Cit* 
phenotype. The second is that the background physically promoted 
the final mutation; for example, a later rearrangement may require 
some prior genomic rearrangement. If expression of Cit* required 
earlier mutations, then the rnk-citT module should confer a weaker 
Cit’ phenotype in a non-potentiated background than a potentiated 
background. Alternatively, if potentiation facilitated the amplification 
event itself, then that module should produce an equally strong Cit* 
phenotype in both potentiated and non-potentiated backgrounds. 


Evidence for epistasis 

To test these predictions, we first tried to move the single-copy rnk- 
citT module into the ancestral chromosome, but several attempts were 
unsuccessful. This outcome is not surprising under the epistasis hypo- 
thesis; a key step was screening potential recombinants for citrate use, 
and the single-copy module conferred an extremely weak Cit* pheno- 
type even in a potentiated C3 clone (Fig. 4). Instead, we moved 
plasmid pZBrnk-citT into the ancestor and clones from clades C1, 
C2 and C3, and examined their growth trajectories (Fig. 6). All four 
transformants grew on citrate after depleting the glucose. However, 
transformants of the ancestor and Cl and C2 clones grew poorly on 
citrate, even with this high-copy plasmid, as shown by long lags while 
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Figure 6 | Evidence for epistatic interactions in potentiation of Cit* 
phenotype. Growth trajectories in DM25 of diverse clones transformed with 
the pZBrnk-citT plasmid. a, Ancestral strain REL606 and its transformant 
ZDB611. b, Clade C1 clone ZDB199 and its transformant ZDB614. c, C2 clone 
ZDB200 and its transformant ZDB615. d, C3 clone ZDB30 and its 
transformant ZDB612. Red and dark blue trajectories are averages for the 
parent clone and its transformant; light blue trajectories show the replicates for 
the transformant. 


transitioning from glucose to citrate, low yields, and inconsistent 
trajectories across replicates (Fig. 6a—c). By contrast, ZDB612, the 
transformant of a potentiated C3 clone, grew much faster, more 
extensively, and consistently across replicates (Fig. 6d). These differ- 
ences demonstrate epistatic interactions between the rnk-citT module 
and mutations that distinguish the backgrounds. 

These data also support the phylogenetic association between clade 
C3 and the strength of potentiation in the replay experiments. We 
examined C3 for candidate mutations that may contribute to poten- 
tiation. A mutation in arcB, which encodes a histidine kinase*’, is 
noteworthy because disabling that gene upregulates the tricarboxylic 
acid cycle’. That mutation might interact with the rnk-citT module 
by allowing efficient use of citrate that enters the cell via the CitT 
transporter. We tried repeatedly to move the evolved and ancestral 
arcB alleles between strains to test this hypothesis, but without 
success. In any case, the profound differences in growth trajectories 
on citrate enabled by the high-copy plasmid (Fig. 6) support the 
hypothesis that potentiation depends, at least partly, on epistasis 
between the genetic background and the amplification mutation that 
generated that module. 


Evidence against the physical-promotion hypothesis 

We examined the Cit* mutants from the replay experiments” for 
additional evidence on the nature of potentiation. The physical- 
promotion hypothesis predicts that these mutants should have cit 
amplifications similar or identical to the original one. If epistatic 
interactions enhanced citT expression only from the rnk promoter, 
then the prediction would be the same. However, if epistasis operated 
at some broader physiological level, then the replays should have 
diverse mutations that share only the property that they enable 
expression of the citrate transporter in the oxic environment of the 
LTEE. We examined 19 re-evolved Cit’ mutants to identify the 
relevant mutations; the citT region was examined in all of them, 
and the genomes of six were sequenced. All have mutations affecting 
citT, and most clearly put that gene downstream of a new promoter 
(Supplementary Table 12). Four sequenced Cit* mutants were 
derived from Cit clones that were also sequenced. Besides citT- 
related mutations, these sequenced mutants had one to three other 
mutations; no gene was mutated in multiple cases, and none seem 
related to citrate use (Supplementary Table 13), supporting the 
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Figure 7 | Mutations that produced Cit* phenotype in 14 replay 
experiments. The red box shows the boundaries of the 2,933-bp amplified 
segment that actualized the Cit” function in the original long-term population. 
Blue boxes show cifT-containing regions amplified in eight replays that 
produced Cit* mutants. Vertical black lines mark five locations where IS3 
insertions produced Cit’ mutants in six other replays. 


inference that the citT mutations were responsible for all of the re- 
evolved Cit” phenotypes. 

The Cit’ mutants arose by diverse mutational processes 
(Supplementary Table 12). Eight have citT duplications similar to 
the original one, though no two share the same boundaries (Fig. 7). 
In seven of these, the duplications generated alternative versions of 
the rnk-citT module; in the other, the second citT is downstream of the 
rma promoter. Six mutants have an IS3 element inserted in the 3’ end 
of citG (Fig. 7). IS3 carries outward-directed promoter elements that 
can activate adjacent genes*”**. Two mutants have large duplications 
encompassing all or part of the cit operon. One mutant has a large 
inversion that places most of that operon downstream of the 
promoter for the fimbria regulatory gene fimB, and another has a 
deletion in citG that presumably formed a new promoter. Also, most 
of these mutants have stronger phenotypes (Supplementary Fig. 5) 
than the earliest Cit” clones in the main experiment (Fig. 4b and 
Supplementary Fig. 3). In any case, this new function arose in poten- 
tiated backgrounds by a variety of mutational processes that recruited 
several different promoters to allow CitT expression during aerobic 
metabolism. Thus, these data do not support the physical-promotion 
hypothesis, whereas the strain-specific differences in growth on 
citrate conferred by the rnk-citT module provide clear and compelling 
evidence for epistasis (Fig. 6). However, these hypotheses are not 
mutually exclusive, and we cannot reject the possibility that some 
mutation rendered the genome (or the affected region) more prone 
to physical rearrangements (including mobile-element insertions) 
and thereby also contributed to the overall potentiation effect. 


Perspective 


The evolution of citrate use in an experimental E. coli population 
provided an unusual opportunity to study the multi-step origin of a 
key innovation. Comparative studies have shown that gene duplica- 
tions have an important creative role in evolution by generating 
redundancies that allow neo-functionalization®®*”°. Our findings 
highlight the less-appreciated capacity of duplications to produce 
new functions by promoter capture events that change gene regula- 
tory networks**. The evolution of citrate use also highlights that such 
actualizing mutations are only part of the process by which novelties 
arise. Before a new function can arise, it may be essential for a lineage 
to evolve a potentiating genetic background that allows the actualizing 
mutation to occur or the new function to be expressed. Finally, novel 
functions often emerge in rudimentary forms that must be refined to 
exploit the ecological opportunities. This three-step process—in 
which potentiation makes a trait possible, actualization makes the 
trait manifest, and refinement makes it effective—is probably typical 
of many new functions. 


METHODS SUMMARY 


Evolution experiment. Twelve E. coli B populations were propagated for over 
20 years by daily 100-fold transfers into a minimal medium, DM25, containing 
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139 uM glucose and 1,700 1M citrate’*“°. Population samples were frozen every 
500 generations. The focus of this study was population Ara-3, in which the 
ability to grow on citrate under oxic conditions evolved by 31,500 generations”. 
Genome sequencing. Clones were sequenced on Illumina GA, GA II and GA IIx 
instruments. The reads were deposited in the NCBI SRA database (SRA026813). 
Mutations were predicted from read alignments to the ancestral genome 
(REL606)”° using the breseq pipeline” and refined by further analyses for specific 
mutations. 

Phylogenetic analysis. The most parsimonious tree was calculated using the 
irreversible Camin-Sokal model** with presence-absence data in each genome 
for all mutational events. Branch lengths were calculated by a maximum 
likelihood procedure with additional parameters for ancestral and mutator rates 
of Poisson-distributed mutation accumulation. Other clones were put in clades 
based on Sanger sequencing to screen for 10 phylogenetically informative 
mutations. 

Expression experiments. Promoter regions for citT, rnk and the rnk-citT module 
were cloned into the /ux reporter plasmid pCS26-pac”’, and transformed into 
strains REL606, ZDB30 and ZDB172. Light emission was measured as a proxy for 
luciferase expression for 19h in DM25 in 96-well plates with intermittent shak- 
ing. Assays were replicated fourfold for each construct. 

Primers and clones. Primer pairs used in genetic analyses and strain construc- 
tions are shown in Supplementary Table 14. Supplementary Table 15 describes 
the clones used in growth-trajectory experiments. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Evolution experiment. The long-term experiment is described in detail 
elsewhere’*”*. In brief, twelve populations of E. coli B were started in 1988 and 
have evolved since under conditions of daily 100-fold dilutions in a minimal 
medium, DM25, containing 139 [1M glucose and 1,700 uM citrate**. The popula- 
tions undergo ~6.64 generations per day and had been evolving for 40,000 
generations when this study began. Every 500 generations, samples were frozen 
at -80 °C with glycerol as a cryoprotectant. The focus of this study was population 
Ara-3, in which the ability to grow aerobically on citrate evolved by 31,500 
generations”. 

Genomic DNA isolation. Clones were revived from frozen stocks by overnight 
growth in LB medium at 37 °C with aeration. DNA was extracted and purified 
using the Qiagen Genomic-tip 100/G kit (Qiagen). 

Whole-genome sequencing and mutation detection. Clones were sequenced on 
Illumina GA, GA II and GA IIx instruments. The resulting reads were deposited 
in the NCBI SRA database (SRA026813). Most data sets contain single-end reads 
only, but additional mate-paired libraries were obtained for ZDB30 and ZDB172. 
Reads were mapped to the reference genome of the ancestral strain (REL606)”*, 
and mutations were predicted using the breseq computational pipeline’. This 
pipeline detects point mutations, deletions and new sequence junctions that may 
indicate IS-element insertions or other rearrangements, as described in its online 
documentation. Large duplications and amplifications were predicted manually 
by examining the depth of read coverage across each genome. 

Mutation lists (Supplementary Table 2) were further refined by manually 
reconstructing the most plausible series of events generating the observed differ- 
ences. This procedure involved: (1) splitting predicted changes into multiple 
mutations based on phylogenetic relationships, (2) assigning mutations to gen- 
omes when a subsequent mutational event prevented their detection (for 
example, an SNP within a later deletion), and (3) correcting false positives for a 
handful of difficult-to-predict mutations by examining alignments and coverage 
in all clones. After these procedures, 17 homoplasies remained in the phylogeny, 
most of which seem to indicate mutational hot spots: 9 are IS-element insertions 
at specific sites, 7 are insertions or deletions at the boundaries of IS elements, and 
only 1 is a single-base substitution not associated with an IS element. Given these 
signatures, it is likely that these mutations occurred independently in multiple 
lineages. 

The copy number and configuration of the citT module were estimated from 
the number of reads overlapping the new rnk-citT sequence junction relative to 
the two original flanking-sequence junctions, and from the average read-depth in 
this region relative to a single-copy region (Supplementary Table 9). 
Phylogenetic analysis. An initial parsimony-based tree was calculated using 
presence-absence data for all mutational events in each genome using the dnapars 
program from PHYLIP. Branch lengths were recalculated using an irreversible 
Camin-Sokal model"*, because the ancestral states are known and reversions 
unlikely given the genome size and number of mutations observed. A maximum- 
likelihood model was then used to estimate branching times and two mutation 
rates, one for non-mutator branches and one for mutator branches. The model 
fixed the generation when each clone was sampled and assumed that mutations 
accumulated on branches according to a Poisson process. 

Ten mutations in nine genes (ybaL, nadR, hemE, cspC, yaaH, leuA, tolR, arcB 
and gitA) were identified as phylogenetically informative based on their asso- 
ciation with particular clades in the Ara—3 population (Supplementary Table 10). 
Sanger sequencing of PCR-amplified gene fragments was used to determine the 
presence or absence of these mutations in replay clones, which were then mapped 
onto the phylogeny using the keys in Supplementary Fig. 4. Owing to the large 
number of clones and genes under consideration, not all genes were sequenced for 
all clones. Supplementary Table 11 shows the data and phylogenetic assignments. 
Supplementary Table 14 shows the primer pairs used to amplify each locus. 
PCR screens for cit amplifications. The cit amplification was detected in popu- 
lation samples and clones by PCR amplification across the rnk-citG junction using 
outward-directed primers specific to citT (Supplementary Table 14). When screen- 
ing population samples, three reactions were run for each time point, and the 
template was a 1:10 dilution of the frozen sample for that generation. 
Expression experiments. Expression was measured using luciferase-based 
reporter constructs. The complete upstream regions for the native citT and rnk 
genes were PCR-amplified from the cognate reporters from the E. coli transcrip- 
tional library” using the primers pZE05 and pZE07. The intergenic region of the 
evolved rnk-citT module was amplified from Cit’ clone ZDB172 using primers 
nctForward and nctReverse (Supplementary Table 14). The PCR products were 
cloned into the low-copy (1-2) plasmid pCS26-pac, which contains a kanamycin 
resistance gene and the luciferase operon (JuxCDABE)”. Each plasmid was trans- 
formed into clones REL606, ZDB30 and ZDB172. 
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Before the expression assays, strains were grown in a 96-well plate (BD 
Biosciences) in 200 ll per well of DM25 supplemented with 50 pg ml’ kanamycin, 
with constant shaking at 37 °C for two 24-h cycles for acclimation. Fresh overnight 
cultures were then diluted 100-fold into a black, clear-bottomed 96-well plate (9520 
Costar, Corning), with 150 ul per well of DM25, and covered with a breathable 
sealing membrane (Nunc) to prevent evaporation. Light emission was measured 
using a Wallac Victor” plate reader (Perkin Elmer Life Sciences) every 20 min for 
19h with 90s of 2-mm orbital shaking before each reading. Assays were run in 
quadruplicate in the same plate. 

Isogenic strain construction. A single-copy chromosomal rnk-citT module 
was placed in a Cit background using the ‘gene gorging’ method*!. Owing to 
problems inherent to the manipulation of amplified genes, we did not attempt to 
move an entire citT amplification segment. Instead, we engineered a cit amp- 
lification junction containing the rnk promoter from three smaller fragments that 
were PCR-amplified from the 32,000-generation Cit* clone ZDB172. The first 
fragment contained the cit amplification junction (citAmp]), including the rnk 
promoter region, and was PCR-amplified using the primers citTAmp] F and 
citTAmp] R (Supplementary Table 14). The second fragment contained the 
citT-citG junction (citT-citG) and was PCR-amplified using the primers citT- 
citG F and citT-citG R. The third fragment contained sequence internal to the 
citG gene (citGfrag) and was PCR-amplified using citGfrag F and citGfrag R. Each 
primer pair was designed with restriction sites that allowed ligation of the 
fragments to a hybrid construct in which the cit amplification junction, including 
the rnk promoter, was embedded within >500 bp of the citG flanking sequence. 
The assembled module was then PCR-amplified using citT-citG Gorge F and 
citGfrag R (Supplementary Table 14). The forward primer incorporated an 
I-Scel restriction site required for the gene-gorging procedure, which was then 
performed as described elsewhere*’. We screened for putative transformants by 
performing PCR and testing for a Cit" phenotype based on a positive reaction on 
Cristensen’s Citrate Agar*. Successful constructs were confirmed by Sanger 
sequencing. 

Growth trajectories. Strains of interest (Supplementary Table 15) were revived 
from frozen stocks by growing them in LB, and they were then acclimated by two 
24-h culture cycles in DM25. Next, 100 ul of each culture was diluted into 9.9 ml 
of DM25, and 200 ul aliquots were placed into randomly assigned wells in a 
96-well plate. For all pZBrnk-citT transformants, the medium was supplemented 
with 100 ugml~' ampicillin to ensure plasmid retention. Growth trajectories 
were replicated six- to eightfold for each strain. To limit evaporation, strains were 
grown in the innermost 60 wells, whereas the outermost 36 wells were filled 
with 300 pl of saline buffer. When assays ran longer than 96h, the buffer was 
replenished after 96h. OD429 nm Was measured every 10min by a VersaMax 
automated plate reader (Molecular Devices). Plates were shaken orbitally for 
5s before each measurement, but were otherwise stationary. 

Fitness assays. Relative fitness was measured in competition experiments 
described elsewhere”. In brief, we inoculated 0.05 ml each of acclimated cultures 
of ZDB595 and ZDB63, an Ara’ mutant of ZDB30, into 9.9 ml of DM25 with 
tenfold replication. Initial densities were measured by dilution plating on 
tetrazolium arabinose (TA) plates, on which ZDB595 and ZDB63 made red 
and white colonies, respectively. The cultures were then propagated for three 
daily transfer cycles, and the final densities were measured using TA plates. 
Relative fitness was calculated as the ratio of the realized growth rates of the 
competitors over the course of the experiment”. 

Isolation of Cit” revertants. Three 10-ml LB broth cultures were inoculated 
with 15 ul from the frozen stock of 33,000-generation Cit” clone CZB154. 
After overnight growth at 37 °C, the cultures were diluted 10°-fold, and 100 pl 
was transferred to 30 flasks, ten containing 9.9 ml of DM25, ten containing 9.9 ml 
of DM25 without citrate, and ten containing 9.9 ml of M9 broth (a citrate-free 
minimal medium) with 25 1g ml~! glucose. The 30 cultures were incubated at 
37 °C, and propagated through six daily 1:100 serial dilutions. Each culture was 
then diluted 10*-fold, and 100 jl were spread onto each of eight TA plates. The 
plates were incubated for ~24h at 37 °C, and colonies were patched onto TA and 
Minimal Citrate (MC) plates. Clones that grew on TA plates, but not on MC 
plates, were then streaked onto Christensen’s agar. Those clones that did not 
produce a Cit* reaction on Christensen’s agar were retained as Cit revertants. 
Up to 512 colonies per replicate culture were tested for loss of the Cit* phenotype, 
but only one Cit” revertant was retained from any culture. Cit” revertants were 
found in 8/10 citrate-free DM25 cultures and in 5/10 cultures containing M9 
medium. No revertants were isolated from any cultures in normal DM25 
medium, presumably because of the advantage that Cit* cells have when citrate 
is present. The 13 Cit’ revertants were tested for the presence or absence of the cit 
amplification by performing both PCR with the citTout F/R primer pair and citT- 
specific Southern blotting with EcoRV-digested genomic DNA (Supplementary 
Fig. 2 and Supplementary Table 14). 
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Plasmid construction. A DNA fragment containing the complete rnk-citT 
module was PCR-amplified from Cit* clone ZDB172 using primers citTAmpX 
F and citTAmpX R (Supplementary Table 14). The fragment was then inserted 
into the cloning site of pUC19 (NEB), and sequencing confirmed that the 
resulting plasmid, pZBrnk-citT, had the corresponding region. The plasmid 
was transformed into strains REL606, ZDB30, ZDB199 and ZDB200. Cit* 
transformants were identified by positive reactions on Cristensen’s Citrate 
Agar. 

Identification of mutations in Cit* replays. Nineteen Cit’ mutants isolated in 
replay and related experiments” were analysed for mutations. The mutants were 
first checked for large changes in the citT region by Southern hybridization with 
citT-specific probes. Genomic DNA was digested with EcoRV (NEB); fragments 
were separated on 0.8% agarose gels with a 1-kb ladder (NEB), then transferred to 
nylon membranes. Hybridizations were performed at 68°C. The citT-specific 
probe was an internal fragment amplified by PCR using primers in Supplemen- 
tary Table 14, purified using an Illustra GFX PCR DNA purification kit (GE 
Healthcare), and labelled using the DIG DNA labelling and detection kit 
(Roche). Most, but not all, mutants had enlarged citT bands. We tried to PCR- 
amplify across possible amplification boundaries of each mutant using the same 


outward-directed citT primers used to screen for the original amplification in the 
Ara-3 population (Supplementary Table 14). PCR products were thus obtained 
for 8 clones. Sanger sequencing of the products showed novel junctions consistent 
with amplifications similar but not identical to the original case. For 7 other 
clones, PCR products of altered size were obtained when the region immediately 
upstream of citT was amplified, and sequencing showed that each alteration was 
caused by either a small deletion or an IS3 insertion in citG. To identify mutations 
at other loci, as well as mutations affecting the cit region, we sequenced the 
genomes of six mutants, as described above. These genomes included three of 
the four mutants for which the mutations conferring the Cit* trait were not 
resolved using the approaches described above. 
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Comprehensive genomic characterization 
of squamous cell lung cancers 


The Cancer Genome Atlas Research Network* 


Lung squamous cell carcinoma is a common type of lung cancer, causing approximately 400,000 deaths per year 
worldwide. Genomic alterations in squamous cell lung cancers have not been comprehensively characterized, and no 
molecularly targeted agents have been specifically developed for its treatment. As part of The Cancer Genome Atlas, here 
we profile 178 lung squamous cell carcinomas to provide a comprehensive landscape of genomic and epigenomic 
alterations. We show that the tumour type is characterized by complex genomic alterations, with a mean of 360 exonic 
mutations, 165 genomic rearrangements, and 323 segments of copy number alteration per tumour. We find statistically 
recurrent mutations in 11 genes, including mutation of TP53 in nearly all specimens. Previously unreported 
loss-of-function mutations are seen in the HLA-A class I major histocompatibility gene. Significantly altered pathways 
included NFE2L2 and KEAP1 in 34%, squamous differentiation genes in 44°, phosphatidylinositol-3-OH kinase pathway 
genes in 47°, and CDKN2A and RBI in 72% of tumours. We identified a potential therapeutic target in most tumours, 
offering new avenues of investigation for the treatment of squamous cell lung cancers. 


Lung cancer is the leading cause of cancer-related mortality worldwide, 
leading to an estimated 1.4 million deaths in 2010 (ref. 1). The discovery 
of recurrent mutations in the epidermal growth factor receptor (EGFR) 
kinase, as well as fusions involving anaplastic lymphoma kinase (ALK), 
has led to a marked change in the treatment of patients with lung 
adenocarcinoma, the most common type of lung cancer?*. More recent 
data have suggested that targeting mutations in BRAF, AKT1, ERBB2 
and PIK3CA and fusions that involve ROS1 and RET may also be suc- 
cessful®’. Unfortunately, activating mutations in EGFR and ALK fusions 
are typically not present in the second most common type of lung cancer, 
lung squamous cell carcinoma (SQCC)’, and targeted agents developed 
for lung adenocarcinoma are largely ineffective against lung SQCC. 

Although no comprehensive genomic analysis of lung SQCCs has 
been reported, single-platform studies have identified regions of 
somatic copy number alterations in lung SQCCs, including amplifica- 
tion of SOX2, PDGFRA and FGFR1 and/or WHSCI1L1 and deletion of 
CDKN2A*"°. DNA sequencing studies of lung SQCCs have reported 
recurrent mutations in several genes, including TP53, NFE2L2, 
KEAP1, BAI3, FBXW7, GRM8, MUC16, RUNXIT1, STK11 and 
ERBB4 (refs 11, 12). DDR2 mutations and FGFR1 amplification have 
been nominated as therapeutic targets’*”>. 

We have conducted a comprehensive study of lung SQCCs from a 
large cohort of patients as part of The Cancer Genome Atlas (TCGA) 
project. The twin aims are to characterize the genomic and epigenomic 
landscape of lung SQCC and to identify potential opportunities for 
therapy. We report an integrated analysis based on DNA copy number, 
somatic exonic mutations, messenger RNA sequencing, mRNA 
expression and promoter methylation for 178 histopathologically 
reviewed lung SQCCs, in addition to whole genome sequencing 
(WGS) of 19 samples and microRNA sequencing of 159 samples 
(Supplementary Table 1.1). Demographic and clinical data and results 
of the genomic analyses can be downloaded from the TCGA data 
portal (https://tcga-data.nci.nih.gov/docs/publications/lusc_2012/). 


Samples and clinical data 
Tumour samples were obtained from 178 patients with previously 
untreated stage I-IV lung SQCC. Germline DNA was obtained from 


adjacent, histologically normal tissues resected at the time of surgery 
(n = 137) or from peripheral blood (n = 41). All patients provided 
written informed consent to conduct genomic studies in accordance 
with local Institutional Review Boards. The demographic characteris- 
tics are described in Supplementary Table 1.2. The median follow-up 
for the cohort was 15.8 months, and 60% of patients were alive at the 
time of the last follow-up (data updated in November 2011). Ninety-six 
per cent of the patients had a history of tobacco use, similar to previous 
reports for North American patients with lung SQCC'’. DNA and 
RNA were extracted from patient specimens and measured by several 
genomic assays, which included standard quality-control assessments 
(Supplementary Methods, sections 2-8). A committee of experts in 
lung cancer pathology performed a further review of all samples to 
confirm the histological subtype (Supplementary Fig. 1.1 and 
Supplementary Methods, section 1). 


Somatic DNA alterations 


The lung SQCCs analysed in this study display a large number and variety 
of DNA alterations, with a mean of 360 exonic mutations, 323 altered 
copy number segments and 165 genomic rearrangements per tumour. 

Copy number alterations were analysed using several platforms. 
Analysis of single nucleotide polymorphism (SNP) 6.0 array data 
across the set of 178 lung SQCCs identified a high rate of copy number 
alteration (mean of 323 segments) when compared with other TCGA 
projects (as of 1 February 2012), including ovarian cancer (477 
segments)’’, glioblastoma multiforme (282 segments)'’, colorectal 
carcinoma (213 segments), breast carcinoma (282 segments) and renal 
cell carcinoma (156 segments) (P< 1 x 10° by Fisher’s exact test). 
These segments gave rise to regions of both focal and broad somatic 
copy number alterations (SCNAs), with a mean of 47 focal and 23 
broad events per tumour (broad events defined as =50% of the length 
of the chromosome arm). There was strong concordance between the 
three independent copy number assays for all regions of SCNA 
(Supplementary Figs 2.1-2.4). 

At the level of whole chromosome arm SCNAs, lung SQCCs exhibit 
many similarities to 205 cases of lung adenocarcinoma analysed by 


“Lists of participants and their affiliations appear at the end of the paper. 
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TCGA (Supplementary Fig. 2.la). The most notable difference 
between these cancers is selective amplification of chromosome 3q 
in lung SQCC, as has been reported”'’. Using the SNP 6.0 array 
platform and GISTIC 2.0 (refs 20, 21), we identified regions of sig- 
nificant copy number alteration (Supplementary Methods, section 2). 
There were 50 peaks of significant amplification or deletion 
(Q< 0.05), several of which included SCNAs previously seen in lung 
SQCCs including SOX2, PDGFRA and/or KIT, EGFR, FGFR1 and/or 
WHSCIL1, CCND1 and CDKN2A*’*” (Supplementary Fig. 2.1b and 
Supplementary Data 2.1 and 2.2). Other peaks defined regions of SCNA 
reported for the first time, including amplifications of chromosomal 
segments containing NFE2L2, MYC, CDK6, MDM2, BCL2L1 and EYS 
and deletions of FOXP1, PTEN and NF1 (Supplementary Fig. 2.1b). 

Whole exome sequencing of 178 lung SQCCs and matched germline 
DNA targeted 193,094 exons from 18,863 genes. The mean sequencing 
coverage across targeted bases was 121, with 83% of target bases above 
30X coverage. We identified a total of 48,690 non-silent mutations with 
a mean of 228 non-silent and 360 total exonic mutations per tumour, 
corresponding to a mean somatic mutation rate of 8.1 mutations per 
megabase (Mb) and median of 8.4 per Mb. That rate is higher than rates 
observed in other TCGA projects including acute myelogenous leuk- 
aemia (0.56 per Mb), breast carcinoma (1.0 per Mb), ovarian cancer’” 
(2.1 per Mb), glioblastoma multiforme’* (2.3 per Mb) and colorectal 
carcinoma (3.2 per Mb) (data as of 1 February 2012, P< 2.2 X 10 re by 
t-test or Wilcoxon’s rank sum test for lung SQCC versus all others). In 
lung SQCC, CpG transitions and transversions were the most com- 
monly observed mutation types, with mean rates of 9.9 and 10.7 per 
sequenced megabase of CpG context, respectively, for a total mutation 
rate of 20.6 per Mb. At non-CpG sites, transversions at C:G sites were 
more common than transitions (7.3 versus 2.9 per Mb; total = 10.2 per 
Mb) and more common than transversions or transitions at A:T sites 
(1.5 versus 1.3 per Mb; total = 2.8 per Mb). 

Significantly mutated genes were identified using a modified version 
of the MutSig algorithm (Supplementary Methods, section 3)**”*. We 
identified 10 genes with a false discovery rate (FDR) Q value < 0.1 
(Supplementary Table 3.1): TP53, CDKN2A, PTEN, PIK3CA, KEAP1, 
MLL2, HLA-A, NFE2L2, NOTCH 1 and RB1, all of which demonstrated 
robust evidence of gene expression as defined by reads per kilobase of 
exon model per million mapped reads (RPKM) > 1 (Fig. 1). TP53 
mutation was observed in 81% of samples by automated analysis; visual 
review of sequencing reads identified a further 9% of samples with 
potential mutations in regions of sub-optimal coverage or in samples 
with low purity. Most observed mutations in NOTCH1 (8 out of 17) 
were truncating alterations, suggesting loss-of-function, as has recently 
been reported for head and neck SQCCs””*. Mutations in HLA-A were 
also almost exclusively nonsense or splice site events (7 out of 8). 

To increase our statistical power to detect mutated genes in the 
setting of the observed high background mutation rate, we performed 
a secondary MutSig analysis only considering genes previously 
observed to be mutated in cancer according to the COSMIC database. 
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This yielded 12 other genes with FDR < 0.1: FAM123B (also known as 
WTX), HRAS, FBXW7, SMARCA4, NF1, SMAD4, EGFR, APC, TSC1, 
BRAF, TNFAIP3 and CREBBP (Supplementary Table 3.1). Both the 
spectrum and the frequency of EGFR mutations differed from those 
seen in lung adenocarcinomas. The two most common alterations in 
lung adenocarcinoma, Leu858Arg and inframe deletions in exon 19, 
were absent, whereas two Leu861Gln mutations were detected in EGFR. 

As described in Supplementary Fig. 3.1, we verified somatic muta- 
tions by performing an independent hybrid-recapture of 76 genes in 
all samples. A total of 1,289 mutations were assayed, and we achieved 
satisfactory coverage to have power to verify at 1,283 positions. We 
validated 1,235 mutations (96.2%) (Supplementary Fig. 3.1 and Sup- 
plementary Methods, section 3). We also verified mutation calls using 
WGS and RNA sequencing data with similar results (Supplementary 
Figs 3.1, 4.3 and Supplementary Methods, sections 3 and 4). 

WGS was performed for 19 tumour/normal pairs with a mean 
computed coverage of 54. A mean of 165 somatic rearrangements 
was found per lung SQCC tumour pair (Supplementary Fig. 3.2), a 
value in excess of that reported for WGS studies of other tumour types 
including colorectal carcinoma (75)”*, prostate carcinoma (108)’°, 
multiple myeloma (21)” and breast cancer (90)*’. Although most 
inframe coding fusions detected in WGS were validated by RNA 
sequencing, no recurrent rearrangements predicted to generate fusion 
proteins were identified (Supplementary Data 3.1 and 4.1). 


Somatically altered pathways 


Many of the somatic alterations we have identified in lung SQCCs 
seem to be drivers of pathways important to the initiation or progres- 
sion of the cancer. Specifically, genes involved in the oxidative stress 
response and squamous differentiation were frequently altered by 
mutation or SCNA. We observed mutations and copy number altera- 
tions of NFE2L2 and KEAP1 and/or deletion or mutation of CUL3 in 
34% of cases (Fig. 2). NFE2L2 and KEAP1 code for proteins that bind 
to each other, have been shown to regulate the cell response to oxid- 
ative damage, chemo- and radiotherapy, and are somatically altered in 
a variety of cancer types**’’. We found mutations in NFE2L2 almost 
exclusively in one of two KEAP1 interaction motifs, DLG or ETGE. 
Mutations in KEAP1 and CUL3 showed a pattern consistent with loss- 
of-function and were mutually exclusive with mutations in NFE2L2 
(Figs 1c and 2). PARADIGM SHIFT” analysis predicts that muta- 
tions in NFE2L2 and KEAPI exert a considerable functional effect (Sup- 
plementary Fig. 7.C.1, 7.C.2 and Supplementary Methods, section 7). 
We also found alterations in genes with known roles in squamous 
cell differentiation in 44% of samples, including overexpression and 
amplification of SOX2 and TP63, loss-of-function mutations in 
NOTCH1, NOTCH2 and ASCL4 and focal deletions in FOXP1 
(Fig. 2). Although NOTCH has been well characterized as an onco- 
gene in haematological cancers*, NOTCH1 and NOTCH2 truncating 
mutations have been reported in cutaneous SQCCs and lung SQCCs**. 
Truncating mutations in ASCL4 are the first to be reported in human 


Figure 1 | Significantly mutated genes in lung 
SQCC. Significantly mutated genes 

(Qvalue < 0.1) identified by exome sequencing are 
listed vertically by Q value. The percentage of lung 
SQCC samples with a mutation detected by 
automated calling is noted at the left. Samples 
displayed as columns, with the overall number of 
mutations plotted at the top, and samples are 
arranged to emphasize mutual exclusivity among 
mutations. Syn., synonymous. 
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cancer and may have a lineage role given the requirement for ASCL1 
for survival of small-cell lung cancer cells**. Alterations in NOTCH1, 
NOTCH2 and ASCL4 were mutually exclusive and exhibited minimal 
overlap with amplification of TP63 and/or SOX2 (Fig. 2), suggesting 
that aberrations in those modulators of squamous cell differentiation 
have overlapping functional consequences. 


mRNA expression profiling and subtype classification 

Whole-transcriptome expression profiles were generated by RNA 
sequencing for the entire cohort and by microarrays for a 121-sample 
subset. Of 20,502 genes analysed, the mean RNA coverage indices were 
19X and 6,420 RPKM (Supplementary Fig. 4.1 and Supplementary 
Methods, section 4). Previously reported lung SQCC gene expression- 
subtype signatures* were applied to both of the expression platforms, 
yielding four subtypes designated as classical (36%), basal (25%), 
secretory (24%) and primitive (15%). The concordance of subtypes 
between the two platforms was high (94% agreement) (Supplemen- 
tary Fig. 4.2). Considerable correlations were found between the 
expression subtypes and genomic alterations in copy number, mutation 
and methylation (Fig. 3). The classical subtype was characterized by 
alterations in KEAP1, NFE2L2 and PTEN, as well as pronounced hyper- 
methylation and chromosomal instability. The 3q26 amplicon was pre- 
sent in all of the subtypes, but it was most characteristic of the classical 
subtype, which also showed the greatest overexpression of three known 
oncogenes on 3q: SOX2, TP63 and PIK3CA. RNA sequencing data 
suggested that high expression levels of TP63, in samples with and 
without amplification of TP63, were associated with dominant expres- 
sion of the deltaN isoform (also called p40), which lacks the amino- 
terminal transactivation domain, compared with the longer isoform, 
called tap63 (89% of tumours overexpressed deltaN compared with 
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tap63; P<2.2 x 10° '°). The short deltaN isoform is thought to func- 
tion as an oncogene**”*, and its expression was most enriched in the 
classical subtype. By contrast, the primitive expression subtype more 
commonly exhibited RB1 and PTEN alterations, and the basal express- 
ion subtype showed NF alterations (Fig. 3). Amplification of FGFRI 
and WHSCIL1 was anticorrelated with the classical subtype and spe- 
cifically with NFE2L2 or KEAP1 mutated samples. Although CDKN2A 
alterations are common in lung SQCCs, they are not associated with any 
particular expression subtype (Fig. 3). 

Independent clustering of miRNA and methylation data indicated 
association with expression subtypes. The highest overall methylation 
was seen in the classical subtype (Fig. 3, Supplementary Figs 5.1 and 
6.1, Supplementary Methods, sections 5 and 6, Supplementary Data 
6.1 and 6.2 and Supplementary Table 5.1). Integrative clustering 
(iCluster)*” of mRNA, miRNA, methylation, SCNA and mutation 
data demonstrated concordance with the mRNA expression subtypes 
and associated alterations (Fig. 3, Supplementary Fig. 7.A.1 and 
Supplementary Methods, section 7). Independent correlation of 
somatic mutations, copy number alterations and gene expression 
signatures revealed notable subtype associations with alterations in 
the TP53, PI3K, RB1 and NFE2L2/KEAP1 pathways (Supplementary 
Fig. 7.B.1 and Supplementary Methods, section 7). 


Analysis of the CDKN2A locus 


Integrated multiplatform analyses showed that CDKN2A, a known 
tumour suppressor gene in lung SQCC** that encodes the p16"\“** 
and p14“** proteins, is inactivated in 72% of cases of lung SQCC 
(Fig. 4a and Supplementary Data 7.1)—by epigenetic silencing by 
methylation (21%), inactivating mutation (18%), exon 18 skipping 
(4%) and homozygous deletion (29%). 


Figure 3 | Gene expression subtypes integrated 
with genomic alterations. Tumours are displayed 
as columns, grouped by gene expression subtype. 
Subtypes were compared by Kruskal-Wallis tests for 
continuous features and by Fisher’s exact tests for 
categorical features. Displayed features showed 
significant association with gene expression subtype 
(P< 0.05), except for CDKN2A alterations. deltaN 
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Analysis of mRNA expression across the CDKN2A locus revealed 
four distinct patterns of expression: complete absence of both p16INK4 
and ARF (33%); expression of high levels of both pl6INK4 and ARF 
(31%); high expression of ARF and absence of p16INK4 (31%); or 
expression of a transcript that represents a splicing of exon 18 from 
ARF with the shared exon 3 of ARF and p16INK4, generating a pre- 
mature stop codon (4%) (Supplementary Fig. 4.4). Almost all of the 
cases completely lacking p16INK4 and ARF expression showed homo- 
zygous deletion (Fig. 4b and Supplementary Data 7.1). In one case, 
p16INK4 expression was detected but analysis of WGS data demon- 
strated an intergenic fusion event that resulted in detectable transcrip- 
tion between exon 1% p16INK4 and exon 18 of KIAA1797 (Fig. 4b, c). 
Interestingly, combined analysis of WGS and RNA sequencing data 
identified tumour suppressor gene inactivation by intra- or interchro- 
mosomal rearrangement in PTEN, NOTCH1, ARIDIA, CTINNA2, VHL 
and NFI, in eight further cases (Supplementary Data 3.1 and 4.1). 

In addition to homozygous deletion, there are frequent mutational 
events in CDKN2A (Fig. 4b and Supplementary Data 7.1). These 
account for 45% of the 56 cases with high p16INK4 and ARF expres- 
sion. Furthermore, methylation of the exon 1% promoter accounts for 
many other cases of CDKN2A inactivation (70% of lung SQCCs with 
ARF expression in the absence of detectable p16INK4). Seven other 
tumours in the high-ARF/low-INK4A group had documented 
mutations of INK4A, primarily nonsense mutations, suggesting 
nonsense-mediated decay as a mechanism. Of the 28% of tumours 
without CDKN2A alterations, RBI mutations were identified in eight 
cases and CDK6 amplification in one case (Fig. 4d). 


Therapeutic targets 
Molecularly targeted agents are now commonly used in patients with 
adenocarcinoma of the lung, whereas no effective targeted agents have 
been developed specifically for lung SQCCs’’. We analysed our genomic 
data for evidence of the two common genomic alterations in adeno- 
carcinomas of the lung: EGFR and KRAS mutations. Only one 
sample had a KRAS codon 61 mutation, and there were no exon 19 
deletions or Leu858Arg mutations in EGFR. However, amplifications 
of EGFR were found in 7% of cases, as were two instances of the 
Leu861Gln EGFR mutation, which confers sensitivity to erlotinib 
and gefitinib”’. 

The presence of new potential therapeutic targets in lung SQCC 
was suggested by the observation that 96% (171 out of 178) of tumours 
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contain one or more mutations in tyrosine kinases, serine/threonine 
kinases, phosphatidylinositol-3-OH kinase (PI(3)K) catalytic and 
regulatory subunits, nuclear hormone receptors, G-protein-coupled 
receptors, proteases and tyrosine phosphatases (Supplementary 
Fig. 7.D.1a and Supplementary Data 7.2 and 7.3). From 50 to 77% 
of the mutations were predicted to have a medium or high functional 
effect as determined by the mutation assessor score*’ (Supplementary 
Fig. 7.D.1a), and 39% of tyrosine and 42% of serine/threonine kinase 
mutations were located in the kinase domain. Many of the alterations 
were in known oncogenes and tumour suppressors, as defined in the 
COSMIC database (Supplementary Data 7.3). 

We selected potential therapeutic targets based on several features, 
including (1) availability of a US Food and Drug Administration 
(FDA)-approved targeted therapeutic agent or one under study in 
current clinical trials (Supplementary Data 7.2); (2) confirmation of 
the altered allele in RNA sequencing; and (3) the mutation assessor 
score”. Using those criteria, we identified 114 cases with somatic 
alteration of a potentially targetable gene (64%) (Supplementary 
Fig. 7.D.1b and Supplementary Data 7.4). Among these, we identified 
three families of tyrosine kinases, the erythroblastic leukaemia viral 
oncogene homologues (ERBBs), fibroblast growth factor receptors 
(FGFRs) and Janus kinases (JAKs), all of which were found to be 
mutated and/or amplified*’. As discussed for EGFR, the mutational 
spectra in these potential therapeutic targets differed from those in 
lung adenocarcinoma (Supplementary Fig. 7.D.2)”. 

To complement a gene-centred search for potential therapeutic 
targets, we analysed core cellular pathways known to represent poten- 
tial therapeutic vulnerabilities: PI(3)K/AKT, receptor tyrosine kinase 
(RTK) and RAS. Analysis of the 178 lung SQCCs revealed alteration 
in at least one of those pathways in 69% of samples after restriction of 
the analysis to mutations confirmed by RNA sequencing and to 
amplifications associated with overexpression of the target gene 
(Fig. 5). Mutational events that have been curated in COSMIC are 
also shown in Supplementary Fig. 7D.2, as is the distribution of muta- 
tions, amplifications and overexpression of the genes depicted in 
Fig. 5. (A summary of all samples and their significant mutations 
and copy number alterations, including alterations in Fig. 5, is shown 
in Supplementary Data 7.5.) Specifically, one of the components of the 
PI(3)K/AKT pathway was altered in 47% of tumours and RTK sig- 
nalling probably affected by events such as EGFR amplification, BRAF 
mutation or FGFR amplification or mutation in 26% of tumours 
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Figure 5 | Alterations in targetable oncogenic pathways in lung SQCCs. 
Pathway diagram showing the percentage of samples with alterations in the 
PI(3)K/RTK/RAS pathways. Alterations are defined by somatic mutations, 
homozygous deletions, high-level, focal amplifications, and, in some cases, by 
significant up- or downregulation of gene expression (AKT3, FGFRI, PTEN). 
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(Fig. 5 and Supplementary Fig. 7.D.3). Alterations in the PI(3)K/AKT 
pathway genes were mutually exclusive with EGFR alterations as 
identified by MEMo* (Supplementary Fig. 7.D.4.). Although the 
dependence of lung SQCC on many of these individual alterations 
remains to be defined functionally, this analysis suggests new areas for 
potential therapeutic development in this cancer. 


Discussion 


Lung SQCCs are characterized by a high overall mutation rate of 8.1 
mutations per megabase and marked genomic complexity. Similar to 
high-grade serous ovarian carcinoma’, almost all lung SQCCs 
display somatic mutation of TP53. There were also frequent altera- 
tions in the following pathways: CDKN2A/RB1, NFE2L2/KEAP1/ 
CUL3, PI3K/AKT and SOX2/TP63/NOTCH1 pathways, providing 
evidence of common dysfunction in cell cycle control, response to 
oxidative stress, apoptotic signalling and/or squamous cell differenti- 
ation. Pathway alterations clustered according to expression-subtype 
in many cases, suggesting that those subtypes have a biological basis. 

A role for somatic mutation in the cancer hallmark of avoiding 
immune destruction* is suggested by the presence of inactivating muta- 
tions in the HLA-A gene. Somatic loss-of-function alterations of HLA-A 
have not been reported previously in genomic studies of lung cancer. 
Given the recently reported efficacy of anti-programmed death 1 
(PD1)* and anti-cytotoxic T-lymphocyte antigen 4 (CTLA4) antibodies 
in non-small-cell lung cancer*®, these HLA-A mutations suggest a pos- 
sible role for genotypic selection of patients for immunotherapies. 

Targeted kinase inhibitors have been successfully used for the treat- 
ment of lung adenocarcinoma but minimally so in lung SQCC. The 
observations reported here suggest that a detailed understanding of 
the possible targets in lung SQCCs may identify targeted therapeutic 
approaches. Whereas EGFR and KRAS mutations, the two most com- 
mon oncogenic aberrations in lung adenocarcinoma, are extremely 
rare in lung SQCC, alterations in the FGFR kinase family are 
common. Lung SQCCs also share many alterations in common with 
head and neck squamous cell carcinomas without evidence of human 
papilloma virus infection, including mutation in PIK3CA, PTEN, 
TP53, CDKN2A, NOTCH1 and HRAS”, suggesting that the biology 
of these two diseases may be similar. 

The current study has identified a potentially targetable gene or 
pathway alteration in most lung SQCC samples studied. The data 
presented here can help to organize efforts to analyse lung SQCC 
clinical tumour specimens for a panel of specific, actionable muta- 
tions to select patients for appropriately targeted clinical trials. These 
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data could thereby help to facilitate effective personalized therapy for 
this deadly disease. 


METHODS SUMMARY 


All specimens were obtained from patients with appropriate consent from the 
relevant Institutional Review Board. DNA and RNA were collected from samples 
using the Allprep kit (Qiagen). We used commercial technology for capture and 
sequencing of exomes from tumour DNA and normal DNA and whole-genome 
shotgun sequencing. Significantly mutated genes were identified by comparing them 
with expectation models based on the exact measured rates of specific sequence 
lesions. GISTIC”** analysis of the circular-binary-segmented Affymetrix SNP 6.0 
copy number data was used to identify recurrent amplification and deletion peaks. 
Consensus clustering approaches were used to analyse mRNA, miRNA and methy- 
lation subtypes using previous approaches”??***784), 
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Cross-neutralization of influenza A 
viruses mediated by a single antibody loop 
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Immune recognition of protein antigens relies on the combined interaction of multiple antibody loops, which provide a 
fairly large footprint and constrain the size and shape of protein surfaces that can be targeted. Single protein loops can 
mediate extremely high-affinity binding, but it is unclear whether such a mechanism is available to antibodies. Here we 
report the isolation and characterization of an antibody called C05, which neutralizes strains from multiple subtypes of 
influenza A virus, including H1, H2 and H3. X-ray and electron microscopy structures show that C05 recognizes 
conserved elements of the receptor-binding site on the haemagglutinin surface glycoprotein. Recognition of the 
haemagglutinin receptor-binding site is dominated by a single heavy-chain complementarity-determining region 3 
loop, with minor contacts from heavy-chain complementarity-determining region 1, and is sufficient to achieve 
nanomolar binding with a minimal footprint. Thus, binding predominantly with a single loop can allow antibodies to 


target small, conserved functional sites on otherwise hypervariable antigens. 


Antibody recognition of protein antigens is predominantly mediated 
by four to six complementarity-determining regions (CDRs), which 
are the variable loops at the tip of each Fab. The relatively large 
footprint of antibodies on their target antigens (~700-900 A* for 
proteins) generally correlates with high-affinity binding. However, 
for neutralizing antibodies against variable pathogens, a larger foot- 
print may result in increased opportunity for escape mutations that 
reduce antibody binding. Moreover, functionally conserved sites on 
otherwise variable antigens may be small, as for the influenza virus 
receptor-binding site; partially protected by glycans, as with both the 
HIV-1 glycoprotein gp120 and influenza; or are sterically restricted 
and difficult for antibodies to access, such as the picornavirus 
‘canyon”’ or the gp120 co-receptor-binding site on HIV-1 (ref. 2). 
By contrast, non-immunoglobulin proteins can also achieve high- 
affinity binding in other ways that may have distinct advantages for 
targeting small, constrained surfaces. For example, bovine pancreatic 
trypsin inhibitor (BPTI) attains exceptionally high-affinity binding 
(Kg at {M levels) to serine proteases by inserting a single loop into 
the enzyme active site’. In light of the increasingly long heavy-chain 
CDR3 (HCDR3) loops being found in humans*°, a BPTI-like binding 
mechanism may be structurally accessible to antibodies, allowing 
insertion of a single loop into a pocket. However, no clear example 
of an antibody using such a binding mechanism has been reported. 
Two conserved and functionally important sites on the haemagglu- 
tinin (HA) stem have been targeted by antibodies previously, includ- 
ing epitopes recognized by the broadly neutralizing antibodies A06 
(refs 7 and 8), CR6261 (refs 9 and 10), F10 (ref. 11), CR8020 (ref. 12) 
and FI6 (ref. 13). Several recent studies have suggested that stem 
antibodies may be present in a greater number of individuals than 


previously thought'*”’, and the ability to re-elicit stem antibodies by 
immunization”’ has raised hopes that a universal vaccine for influenza 
A may be achievable. Although such antibodies against the stem are 
highly cross-reactive, most antibodies that target the more variable 
receptor-binding domain (RBD) of the HA1 subunit show limited 
breadth of neutralization. Crystal structures show that, although a 
few antibody footprints on the HAI RBD sometimes coincide with 
the receptor-binding site'’*°, many of the essential interactions are 
made with hypervariable regions well outside of the functionally con- 
served region involved in sialic acid recognition’’*. However, recent 
work suggests that some rare antibodies against the HA1 RBD can 
achieve modest cross-reactivity***. Thus, identification and struc- 
tural understanding of heterosubtypic antibodies against the RBD 
with broad activity, particularly against human pandemic viruses 
(influenza H1, H2 and H3 subtypes), would be a major advance 
and facilitate development of new therapeutics complementary to 
those targeting the stem. Here we report the functional and structural 
characterization of one such antibody, C05, which neutralizes mul- 
tiple subtypes by inserting an extended CDR loop into the receptor- 
binding pocket. 


Isolation and characterization of C05 


Previously, we isolated antibodies from phage-displayed combinatorial 
libraries derived from Turkish patients who survived H5N1 avian flu 
infection’* and identified a new class of antibodies effective against a 
broad range of group 1 influenza A viruses. Here we used a similar 
approach to identify antibodies that would neutralize both group 1 and 
group 2 viruses. Phage libraries constructed from the immune reper- 
toires of seasonal influenza infection survivors were doubly selected 
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Figure 1 | C05 neutralizes multiple influenza virus subtypes from groups 1 
and 2. Phylogenetic tree with the two main viral lineages indicated; group 1 is 
circled in blue and group 2 is circled in green. Strains from subtypes circled in 
red are bound or neutralized by C05. 


against HA proteins from H1 (group 1) and H3 (group 2). This process 
yielded a limited number of clones that reacted broadly with both H1 
and H3 HA proteins. One clone, C05, which uses the Vxj3-23 and Vk 1- 
33*01 heavy- and light-chain V genes, respectively, has two distinctive 
structural features: a long 24-amino-acid HCDR3 (Supplementary 
Fig. 1) and a five-residue somatic insertion in HCDR1. C05 potently 
neutralizes viruses from H1, H2, H3 and H9 subtypes in vitro (Fig. 1 and 
Table 1a), including the influenza A subtypes that have caused human 
pandemics. However, C05 had no detectable activity against the H5 
subtype, as well as some of the H1 viruses tested. In sharp contrast to 
most broadly neutralizing antibodies to influenza’''”, which bind the 
more conserved stem region, C05 inhibits receptor binding and hae- 
magglutination (Table 1a), indicating recognition of an epitope in the 
HAI globular head domain. 


Protection in vivo 


To explore the in vivo efficacy of C05, we examined its ability to 
protect mice from lethal challenge with HIN1 or H3N2 viruses. 
When administered prophylactically 24h before challenge with the 
A/Memphis/3/2008 (H1N1) virus, 1mgkg ' C05 IgG protected 
100% of mice from death and prevented substantial weight loss 
(Fig. 2a). Lower doses (0.025-0.25 mg kg” ') still afforded propor- 
tional protection (Supplementary Fig. 2a). For the A/X-31/1968 
(H3N2) virus, a 10mgkg | dose protected 100% of the mice from 
death (Fig. 2b), and both 0.25 and 1.0 mgkg * doses afforded protec- 
tion to the majority (Supplementary Fig. 2b). 

Next we determined the therapeutic potential of C05 against an 
established HIN1 or H3N2 infection. A single 15 mgkg * IgG dose 
was administered at 1, 2, 3, 4 or 5 days after lethal challenge with the 
aforementioned H1N1 or H3N2 viruses. In both instances, mice 
treated up to 3 days after infection were completely protected 
(Fig. 2c, d). We then evaluated the minimal dose needed to overcome 
established infection 3 days after inoculation, the latest time at which 
C05 treatment was completely protective against death. As little as a 
single 3mgkg ' dose of antibody prevented infection-related death 
in 80% of the animals (Supplementary Fig. 2c). These in vivo results 
indicate the utility of C05 in both prevention and treatment. 


Avidity is critical for C05 function 

To further profile its breadth of activity, we tested C05 Fab binding to a 
large panel of HAs by biolayer interferometry (BLI), including repres- 
entative members of most of the 16 influenza subtypes. C05 Fab binds to 
a number of diverse HAs spanning both group 1 and group 2 viruses, 
including members of the H1, H2, H3, H9 and H12 subtypes (Table 1b). 
Notably, many HAs from viruses neutralized by IgG C05 are bound with 
only intermediate-to-low affinity by the Fab (Ka ~ 100nM-10 uM), 
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Table 1 | CO5 binds and neutralizes multiple group 1 and group 2 
influenza A viruses 


a C05 neutralizes group 1 and group 2 influenza viruses in vitro 


Group Subtype Strain MIC (ug mi~?) 
1 H1N1 A/Virginia/670/1987 >100* 
A/Texas/36/1991 >100* 
A/New Caledonia/20/1999 <0.1* 
A/Solomon Islands/2006 1.56* 
A/Brisbane/59/2007 1.56* 
A/Memphis/3/2008 <2.3* 
A/California/04/2009 >100* 
H2N2 A/Guiyang/1/1957 184 
H5N1 A/Vietnam/1203/2004 >100* 
HON2 A/Hong Kong/1073/1999 90+ 
2 H3N2 A/X-31/1968 <0.1* 
A/Panama/2007/1999 0.39* 
A/Wisconsin/67/2005 <0.1* 
A/Brisbane/10/2007 12.5* 


b CO5 binding to group 1 and group 2 influenza HAs 


Group Subtype Strain CO5 CR6261 CR8020 
il H1N1 A/South Carolina/1/1918 - +t+4++ °° - 
1 H1N1_ A/New York/1/1918 - ++4+4+  °- 
1 H1IN1 A/AA/Marton/1943 = ++4+4+ °° - 
1 H1N1 A/USSR/90/1977 = +4+4+4+ - 
1 H1N1_ A/Singapore/6/1986 + +444 °° = 
il H1N1 A/Texas/36/1991 = +4+4+4+ °° - 
il H1N1 A/Beijing/262/1995 +++ 4¢4+4+4+ - 
cl H1N1_ A/Solomon Islands/3/2006 +44+ 4444+ °° - 
1 H2N2 A/Japan/305/1957 +++ ++ - 
1 H2N2_ A/Adachi/2/1957 +++ ++ - 
1 H5N1_ A/Vietnam/1203/2004 - ++4++ °° - 
1 H5N1_ A/Indonesia/05/2005 _ ++4+4+ °° - 
i H6N2_ A/turkey/Massachusetts/3740/1965 - +444 °° - 

i HON2 A/turkey/Wisconsin/1/1966 +44+ 4444+ °° - 
1 H12N5 A/duck/Alberta/60/1976 + - - 
1 H13N6 A/gull/Maryland/704/1977 - ++4++ °° - 
1 H16N3 A/black-headed gull/Sweden/4/99 - +++ - 
2 H3N2_ A/duck/Ukraine/1/1963 - = +444 
2 H3N2_ A/Hong Kong/1/1968 ++ = +444 
2 H3N2_ A/Bangkok/1/1979 - — +444 
2 H3N2_ A/Victoria/3/1975 - — +444 
2 H3N2_ A/Beijing/353/1989 - — +444 
2 H3N2_ A/Shangdong/9/1993 - — +444 
2 H3N2 A/Panama/2007/1999 ++ -—  +4+4++4+ 
2 H3N2 A/Moscow/10/1999 ++ — +444 
2 H3N2_ A/Brisbane/10/2007 + — +444 
2 H3N2_ A/Perth/16/2009 ++++ — +444 
2 H4N6 A/duck/Czechoslovakia/1956 - - ++ 
2 H7N7_ A/Netherlands/219/2003 - —— 
2  H10ON7 A/chicken/Germany/N/1949 - — +444 
2 H14N5 A/mallard/Astrakhan/263/1982 - - ++ 
2 H15N8 A/shearw./W.Australia/2576/1979 - - +++ 
c CO5 shows unusually low affinity for some HAs from neutralized viruses 
Group Subtype Strain C05 Kg (nM) MIC (ug mi~?) 
BLI ITC 
2 H3N2_ A/Perth/16/2009 18 26 ND 
A/Hong Kong/1/1968 430 100 <0.1 
A/Panama/2007/1999 720 225 0.4 
A/Brisbane/10/2007 >5,000 ~10,000 12.5 


Dissociation constants: +Ky = 5,000 nM; ++Kg = 500-5,000 nM; +++Ky = 50-500 nM; 
++++Ky=50nM. 

MIC, minimum inhibitory concentration; ND, not determined. 

* Haemagglutination inhibition. 

+ Microneutralization. 


well outside the range typically required for effective neutralization, such 
as by stem antibodies (Ky < 100 nM)’°”’. To confirm these unexpected 
results, we selected four HAs from neutralized viruses that bound C05 
Fab with a range of affinities (Kg = 18 nM, 430nM, 720 nM, and one 
with no detectable Fab binding by BLI) and assessed C05 binding in 
solution by isothermal titration calorimetry (ITC). The ITC results were 
in good agreement with the Ky measured by BLI (Table 1c). Notably, 
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Figure 2 | C05 protects mice from lethal virus challenge. a, b, Survival and 
weight loss were monitored in response to varying amounts of C05 IgG 

administered prophylactically to mice 24h before challenge with 25x the 50% 
mouse lethal dose (MLDso) of A/Memphis/3/2008 (H1N1) (a) or 33% MLDso 


although only very weak binding (Ky;~10uM) of C05 Fab to 
A/Brisbane/10/2007 (H3N2) HA was detected by ITC (undetectable 
by BLI), the corresponding virus was neutralized by C05 IgG in vitro 
(Table 1c). Thus, despite the unusually low affinity of C05 Fab for many 
of its targets, CO5 IgG potently neutralizes virus replication in vitro and 
is protective in vivo. To the best of our knowledge, only one other 
neutralizing antibody has been reported with such a considerable dis- 
crepancy between binding affinity and neutralization. The broadly 
neutralizing anti-HIV antibody 2G12 binds a high-mannose glycan 
cluster on the gp120 subunit of the HIV surface spike, but its affinity 
for a single high-mannose glycan is very low (Kg at [1M levels)’*. 
However, a highly unusual domain-swapping of the Fabs allows 
2G12 to bind multiple glycans simultaneously and thereby achieve 
high-affinity binding (nM levels) through avidity”’”. For C05, the pres- 
ence of multiple antibody-binding sites on each trimeric HA spike and 
the high density of HA on the virus surface may facilitate simultaneous 
engagement of both arms of the C05 IgG, resulting in much higher 
avidity compared with Fab. Electron microscopy experiments with H1, 
H2 and H3 HAs show that three C05 Fabs are bound per trimer 
(Fig. 3f), but their geometry is inconsistent with bivalent attachment 
by an IgG antibody. Indeed, C05 IgG binds to an array of immobilized 
HA with much higher affinity (nM to ~pM levels) than its Fab, con- 
sistent with the single-particle electron microscopy analysis confirming 
the ability of IgG to cross-link HA spikes (Supplementary Fig. 3). Thus, 


528 | NATURE | VOL 489 | 27 SEPTEMBER 2012 


120 
110 
100 


+ 25mg kg" 
~@ 10mg kg 
-& 2.5mg kg" 
—@& 11mg kg! 
> PBS 


90 
80 
70 


60 
101234567 8 91011121314 


% day 1 weight 


Days post-infection 


120 & 25 mg kg" 

@ 10 mg kg" 

-t 2.5 mg kg" 

“@ 25 mg kg" control 
> PBS 


110 
100 
90 
80 
70 


% day 1 weight 


10 
101234567 8 91011121314 
Time (days) 


110 
+ 1 DPI 


© 2DPI 
-# 3DPI 
-—@ 4DPI 
© 5DPI 
-e 1 DPI-PBS 


100 


90 


80 


% day 0 weight 


70 


60 
012345 67 8 910111213 14 


Time (days) 


+ 1DPI 
© 2 DPI 
5 -« 3DPl 
© = 4DPI 
© 5 DPI 
~@ 1 DPI-PBS 


% day 0 weight 


0 
012345 6 7 8 91011121314 


Time (days) 


of A/X-31/1968 (H3N2) viruses (b). ¢, d, A single therapeutic dose of 

15mgkg ' C05 IgG was delivered 1, 2, 3, 4 or 5 days post-infection (DPI) with 
25X MLDso of A/Memphis/3/2008 (c) or 33X MLDsp of A/X-31/1968 viruses 
(d). 2 =5 mice for all studies. Error bars denote + s.d. 


some antibodies with low-affinity Fab interactions can still have strong 
activity in vitro and in vivo and may represent an under-appreciated 
component of the immune repertoire. The role of avidity may be more 
or less pronounced depending on the nature of the epitope and the 
density of the antigen on the viral surface, with clustered epitopes, such 
as those on the HAI RBD, benefiting most from avidity, whereas 
sterically crowded (for example, HA stem) or sparse epitopes (HIV 
envelope spikes) probably prevent bivalent binding. In the latter case, 
high-affinity binding must be achieved by the Fab or by heteroligation 
of two different antigens, which may have a more prominent role in the 
neutralization of human viruses such as HIV”. 


Crystal structure of the CO5-HA complex 

To understand how the antibody C05 achieves heterosubtypic 
neutralization, we determined the crystal structure of C05 Fab in 
complex with the HA1 subunit from the A/Hong Kong/1/1968 
(HK68/H3) influenza strain at 2.95A resolution, and Fab C05 in 
complex with the complete, trimeric ectodomain of HK68/H3 at 
4.25A (Supplementary Table 1). In addition, high-resolution 
structures of HK68/H3 HA (1.90 A) and C05 Fab (2.30 A) provided 
high-quality starting models for refinement of the complexes. Few 
differences are apparent between the isolated HA1 and the HA1 
subunit in trimeric HA, as in other HAI crystal structures’®. 
Further, the CO5-HA1 and C05-HA trimer structures are in excellent 
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Figure 3 | C05 binds the receptor-binding site on the HAI head. a, Crystal 
structures of C05 Fab in complex with trimeric HA and an HA1 fragment reveal 
the location of the C05 epitope in the HAI ‘head’ region. The C05 Fab heavy 
(orange) and light (yellow) chains and the HA1 (blue) and HA2 (red) subunits 
are depicted as ribbons. b, C05 inserts its long HCDR3 (red) into the receptor- 
binding site. c, C05 binds HA using only its heavy chain. d, e, Interaction of the 
HA receptor-binding pocket (in a solid electrostatic surface representation) 
with HCDR3 from C05 (in yellow backbone with side chains labelled by residue 
type and number) (d) and with an «2,6 sialoglycan receptor (derived from 
Protein Data Bank entry 1MQN), with an additional carbohydrate residue 


agreement (Supplementary Fig. 4), indicating that the CO5-HA1 
complex faithfully recapitulates the native interaction. 

Three C05 Fabs bind near the membrane-distal end of the HA 
trimeric spike (Fig. 3a), and HCDR3 penetrates the receptor-binding 
site (Fig. 3b). Several notable features of the C05 interaction are 
immediately apparent, and the exceptionally long 24-residue HCDR3 
of C05 dominates the antibody-antigen interaction (Fig. 3b). Although 
~1.5% of human antibodies in the Abysis database have an HCDR3 of 
24 or more residues, only 3 of the over 1,000 Fab structures in the 
Protein Data Bank have more than 24-residue HCDR3s (Supplemen- 
tary Fig. 5). Very long CDRs can adopt unusual conformations, such as 
the ‘hammerhead’ structure in PG9 and PG16*°”, and these long loops 
are presumably crucial for antigen recognition, yet few structures of 
such antibodies bound to antigen have been reported. The C05 struc- 
ture therefore gives us a valuable glimpse at how long HCDR3s can be 
used to target recessed or evolutionarily constrained epitopes. 

The long HCDR3 of C05 extends far from the antibody surface, 
with its most distal end forming a B-hairpin. The long HCDR3 holds 
the Vj; and V; domains at a distance from the HA surface (Fig. 3c), 
preventing the light chain from contacting the epitope, reminiscent of 
the heavy-chain-dominant recognition by stem-directed V}1-69 
antibodies”"’. At a first approximation, the C05 HCDR3 resembles 
the PG9 and PG16 hammerheads, but with its shorter carboxy- 
terminal branch truncated (Supplementary Fig. 6). The HCDR3 
B-hairpin inserts into the shallow HA receptor-binding pocket near 
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modelled in grey (e). The binding site for the C05 H3 loop and the glycan 
receptor overlap extensively. f, Three-dimensional electron microscopy 
reconstructions of C05 Fab (red) bound, from left to right, to H1 (blue), H2 
(light blue) and H3 (cyan) HAs. The fitting of H1, H2 and H3 crystal structures 
into the reconstructions support a common binding mode of C05 to each of the 
HAs studied. g, In addition to its long HCDR3 (red), C05 has a five-residue 
insertion in HCDRI (yellow), extending the tip of this loop relative to the 
germline-encoded canonical conformation. For comparison, a sequence- 
related Fab structure (Protein Data Bank code 2VXS) without an insertion in 
HCDR1 (blue loop) is superimposed onto C05. 


the HA spike apex (Fig. 3d). HCDR3 overlaps with the binding site for 
glycan receptors, indicating that C05 inhibits virus attachment by 
direct competition with sialic acid (Fig. 3e). However, C05 interaction 
with the receptor-binding site differs considerably from sialic acid 
binding to HA. 

Aside from HCDR3, only HCDR1 makes additional minor inter- 
actions with HA, in part owing to a five-residue somatic insertion 
(Fig. 3g and Supplementary Fig. 1). Insertions and deletions (indels) 
within the V gene coding region are relatively rare*”*’, but recent work 
suggests that these indels may be selected during affinity maturation 
and can be important for antibody function****. Consistent with the 
minor contacts made by HCDR1, removal of the five-residue insert 
(del(FGEST)) or scanning mutagenesis of HCDR1 had no detectable 
effect on C05 binding to HK68/H3 and several other H1, H2 and H3 
HAs (Table 2 and Supplementary Table 2), whereas binding of 
C05(del(FGEST)) to the HA of H3 influenza subtype strain Perth09 
was reduced nearly 100-fold compared with wild-type C05 (Table 2), 
indicating that HCDR1 may make additional interactions with 
Perth09 (see discussion in Supplementary Information and Sup- 
plementary Fig. 7). Thus, the HCDR1 insertion is dispensable for 
binding to most HAs tested, whereas the HCDR3 loop alone is suf- 
ficient for full activity. However, nearby antibody CDRs and frame- 
work regions probably contribute to binding by supporting HCDR3, 
as the loop conformation is essentially identical in the crystal structure 
of C05 Fab alone. In particular, the tip of HCDR2 interacts with the 
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Table 2 | Effect of HCDR1 mutations on C05 binding 


Dissociation constants (Ky, nM) 


C05 variant 
Strain WT del(FGEST) F27bA G27cA E27dA T31F T31L T31A 
A/Hong Kong/1/1968 (H3N2) 500 690 610 450 540 420 1,400 640 
A/Perth/13/2009 (H3N2) 18 1,300 460 33 28 6.8 560 550 


WT, wild type. 


backside of the HCDR3 loop and harbours several somatic mutations 
that may have arisen to best position and stabilize HCDR3 for HA 
binding (Supplementary Fig. 8). By contrast, HCDRI1 seems to have a 
lesser role, as the above mutations above did not prevent C05 binding. 


Loop-insertion binding mechanism 

Despite high sequence diversity immediately surrounding the receptor- 
binding site, the functional constraints imposed by binding to sialic acid 
receptors restrict variation within the pocket itself. Thus, although 
other antibodies can bind in or near the receptor-binding site, a typical 
antibody footprint would extend well beyond the conserved functional 
region and contact at least some variable positions around the receptor- 
binding site, rendering them largely strain specific and susceptible to 
escape through antigenic drift. A notable exception is CH65, which 
cross-neutralizes an unusually large subset of HIN1 viruses, but not 
other subtypes”. However, unlike C05, CH65 binds HA using both 
heavy and light chains (including five of the six CDR loops), which 
increases contact with these more variable surfaces (Supplementary 
Fig. 9). By contrast, the long HCDR3 of C05 targets the conserved 
elements of the receptor-binding site, and enters and exits the pocket 
without contacting many of the surrounding variable positions (Fig. 4a, 
b). Indeed, we were unable to generate C05 escape mutants with an H3 
virus despite repeated attempts. 

Loop insertion is a common mechanism for achieving high-affinity 
protein-protein interactions, as notably illustrated by BPTI’. Owing 
to the restricted, single-CDR-driven interaction of C05 with HK68/ 
H3 HA, only 550 A? of surface area is buried on the HA by C05, which 
is unusually small for an antibody—protein interaction. This small 
footprint allows C05 to achieve broader cross-reactivity compared 
with other antibodies that extend their interface outside of the receptor- 
binding site (for example, ~650 A? buried surface on HA for CH65 
and 740 A® for 2D1; Supplementary Table 3). Single-particle three- 
dimensional electron microscopy reconstructions of C05 bound to 
H1, H2 and H3 HAs (Fig. 3f and Supplementary Fig. 10), along with 
binding experiments probing the effect of several mutations in the 
HCDR3 loop (Supplementary Fig. 11), clearly illustrate that binding to 
other subtypes is probably very similar to that in the H3 crystal structure. 


Epitope conservation and variation linked to escape 


The wide range of affinities for C05 against a panel of HAs indicates 
that epitope variation has a considerable effect on C05 activity. 
Therefore, we examined that sequence diversity contained within all 
full-length, non-redundant human H1, H2 and H3 HA sequences in 
the NCBI Flu database**’. An analysis including all 16 virus subtypes 
yielded similar results (Supplementary Table 4). Of 17 C05 contact 
residues on HA, 6 core residues are =99% conserved across human 
H1, H2 and H3 viruses, including 98, 134, 136, 153, 190 and 194 
(Fig. 4a, b). The remaining 11 positions are generally <50% conserved 
(ranging from 42 to 76%). Therefore, the C05 epitope is markedly less 
conserved (mean % identity/mean % conservation across all contact 
residues is 59%/64%) compared with stem epitopes recognized by 
broadly neutralizing antibodies (76%/86% for CR6261 and 71%/ 
80% for CR8020), owing to greater variation within the receptor- 
binding site. Thus, the range of observed affinities for C05 binding 
to H3 HAs may arise from multiple, subtle mutations (increasing 
Kg two- to tenfold) that in combination tune C05 affinity for HA 
and ultimately reduce binding below the threshold of detection in 
some cases. Substitutions at HA1 190, 225, 226 and 228 are important 
for determining receptor specificity in HAs adapted to particular 
hosts (for example, «2-3 versus «2-6 sialoglycans for avian versus 
human viruses, respectively), and mutations modulating receptor 
affinity are a well-documented mechanism of antibody escape”. 
C05 makes only minor contacts with Leu 226 and does not contact 
residues 225 or 228. C05 interacts more extensively with Asp 190 in 
HK68/H3, but Asp, Glu and Val are present at position 190 in our 
panel of HAs bound by C05. Thus, C05 may bind equally well to HAs 
with either «2,3 or «2,6 receptor specificity, but variability within the 
receptor-binding site may account for some of the observed hetero- 
geneity in C05 binding. 

Several common indels are close to the receptor-binding site and 
may affect antibody binding (Supplementary Fig. 12). Insertions at 
position 133a result in a local bulge that clashes directly with C05 
HCDR3 and probably abrogates binding. This insertion is observed in 
100% of all H6 and H10, 96% of all H5 and 66% of all H1 subtype 
sequences. Consistent with this observation, no H5 viruses tested were 


Figure 4 | C05 epitope conservation across influenza A viruses. a, Owing to 
the functional constraints imposed on the receptor-binding site by interaction 
with sialoglycans, many of the residues that make up the C05 epitope are 
conserved among human influenza viruses. C05 contact residues are depicted 
as sticks and the per cent conservation of each position across human H1, H2 
and H3 viruses—the subtypes that have caused pandemics and epidemics in 
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humans—is shown. b, Similar to the view in a, but includes the position of the 
C05 HCDR3 (yellow ribbon and sticks) relative to the epitope variation that is 
colour coded on the HA backbone (with warmer colours indicating higher 

conservation). Contact residues on HA are depicted with a thick tube for their 
backbone, and non-contacting regions of HA are shown as a thin white ribbon. 
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neutralized and binding to H6, H10 and multiple H5 isolates was 
undetectable, even with 10 1M IgG. Similarly, C05 binding to five 
different H1 HAs with the 133a insertion was undetectable with 
10 uM IgG, and only very weak binding was observed to a sixth 
(A/Singapore/6/1986); no viruses with the insertion were neutralized. 
By contrast, all H1 HAs without the 133a insertion were bound and/or 
neutralized by C05. Thus, the 133a insertion seems to negatively affect 
binding of C05, similar to the effect of the 133a insertion on CH65 
(ref. 20). In addition, indels near position 158 and the 220-loop 
correlate with poor activity against H4, H6, H7, H10, H14 and H16 
subtypes (Supplementary Fig. 12). These insights have enabled us to 
delineate the most important structural requirements for high- 
affinity cross-reactive binding to the receptor-binding site. 


Implications for antiviral therapeutics and vaccine design 


Because C05 binding is accomplished exclusively through the heavy 
chain, and primarily through HCDR3, its interaction with HA defines 
a minimal set of contacts sufficient for cross-group neutralization, 
which may be exploited to develop antiviral therapeutics. The ability 
of a broadly neutralizing antibody, such as C05, to penetrate the 
shallow receptor-binding site with relatively high affinity (uM to 
nM levels) and avidity (nM) compared with its natural receptor 
(mM) makes such an antibody extremely attractive for therapeutics 
and diagnostics. C05 was discovered from a single human donor 
source library using the power of phage-display library technology 
to isolate this rare antibody-binding interaction. Owing to the com- 
binatorial nature of antibody library construction, we cannot deter- 
mine explicitly whether the C05 heavy chain and light chain pairing 
previously existed in the donor. However, other receptor-binding site 
antibodies, such as CH65 (ref. 20), have been isolated from B cells, 
showing that receptor-binding site antibodies can be found naturally 
and are potentially inducible by vaccination strategies. Thus, the design 
of HA immunogens to elicit more broadly neutralizing head antibodies 
may now be more feasible and will complement the arsenal of already 
promising stem antibodies for use in a more universal vaccine, as well 
as in structure-based and computational approaches to design small 
proteins and other possible therapeutics against flu, such as already 
achieved against the stem epitope”®. 


METHODS SUMMARY 

Isolation of C05. Phage antibody libraries were cloned from the bone marrow of 
donors with confirmed influenza virus exposure. Phages were selected by sequential 
panning against A/Wisconsin/67/05 (H3N2) followed by A/New Caledonia/20/ 
1999 (HIN1). 

Neutralization. Microneutralization and HA-inhibition assays were carried out 
according to standard protocols, as described in detail in Methods. 

In vivo protection in mice. Animal experiments were performed in accordance 
with the guidelines of the Mount Sinai School of Medicine and St Jude Children’s 
Research Hospital Institutional Animal Care and Use Committees (IACUC). All 
animal experiments were done essentially as described in ref. 8. Virus strains used 
in these studies were A/Memphis/3/2008 (H1N1) and A/X-31/1968 (H3N2). 
5-6-week-old BALB/c mice were used for the HIN1 studies and 8-week-old 
DBA/2 mice for the H3N2 studies. 

Proteins. Antibodies were expressed in Escherichia coli, baculovirus and 
mammalian cells as described in Methods. HAs were expressed in baculovirus 
as previously described in ref. 12 or purchased from Protein Sciences Corp. 
Binding data. Binding of C05 IgG or Fab to recombinant HAs and Kg deter- 
mination was performed by BLI, ITC and enzyme-linked immunosorbent assay. 
Structure determination. Robotic crystal screening was carried out using the 
Rigaku Crystalmation system at the Joint Center for Structural Genomics. 
Diffraction data were collected at the Advanced Photon Source GM/CA-CAT 
23ID-B and 23ID-D and SSRL 11-1 beamlines and processed using HKL2000 
(HKL Research) and XPREP (Bruker). Structures were solved by molecular 
replacement using Phaser, adjusted using Coot and refined in Phenix. Kabat 
numbering was applied using Abnum”’. Structural analyses were performed 
using programs MS and CONTACSYM. 

Sequences. PDB antibody sequences were analysed through the Abysis database. 
HA sequence data sets were downloaded from the NCBI Flu database™, aligned 
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with MUSCLE” and analysed using Genetics Computer Group (Accelrys) and 
custom shell scripts. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

HA protein and whole-virus enzyme-linked immunosorbent assay. 
Recombinant HA proteins: H5 protein, A/Vietnam/1203/2004 (Protein 
Sciences), 10 ng per well; recombinant HA H1 protein, A/ New Caledonia/20/ 
99 (Protein Sciences), 10 ng per well; recombinant HA H3 protein, A/Wisconsin/ 
67/05 (Protein Sciences), 10ng per well; HIN1 virus, A/New Caledonia 
(BioSource), 70 ng per well; H3N2 virus, A/Panama/2007/99 (BioSource), 10 ng 
per well. 

Enzyme-linked immunosorbent assay (ELISA)-inactivated virus was incu- 
bated overnight at room temperature (20-22 °C). The next day plates were 
blocked (1% bovine serum albumin in PBS buffer and 0.05% Tween-20) and 
0.1 ml serum samples (diluted in blocking buffer) were incubated, washed and 
detected using a peroxidase-conjugated anti-human Fe antibody (Jackson 
ImmunoResearch) and tetramethyl benzidine detection (BioFX). Attenuance at 
D450 nm Was read, and data recorded is reported herein. 

Library construction. Antibody phage-display libraries were created from the 
individual immune repertoires of confirmed influenza infection survivors. Bone 
marrow aspirates were collected and stored in RNAlater (Ambion) to preserve 
RNA integrity until processing for library construction. Serum samples were 
taken from each individual and used to confirm antibody reactivity to various 
HA proteins or inactivated influenza viruses by ELISA (Supplementary Fig. 13). 
The phage-display libraries were constructed as previously described in ref. 7. 
Selection of HA-binding clones. Phage antibody library panning was done 
essentially as described in ref. 7. Cross-reactive antibodies were identified by 
selection on both H3 and H1 HA protein antigens. After two rounds of panning 
on H3 HA A/Wisconsin/67/05, H1 HA protein from A/New Caledonia/20/1999 
replaced the H3 HA protein for two additional rounds. A fifth and final round of 
selection was performed on the H3 HA protein. 

IgG expression and Fabs. The C05 heavy- and light-chain regions were 
mammalian-codon-optimized, synthesized (GeneArt) and cloned into the 
pTT5-based expression vectors. Expression was done essentially as described in 
ref. 7. The C05 Fab fragment was obtained by Lys-C digestion of IgG, followed by 
purification by protein A (GE Healthcare), protein G (GE Healthcare), cation 
exchange (MonoS, GE Healthcare) and gel filtration (Superdex 200, GE 
Healthcare). For binding studies, C05 Fab was expressed recombinantly using 
the baculovirus system. A DNA fragment derived from the promoter region of 
pFastBac-Dual (Invitrogen) was fused to the gp67 and the honey bee melittin 
(HBM) signal peptides by overlap PCR, yielding a fragment with head-to-head 
p10 and polyhedrin promoters upstream of the HBM and gp67 signal peptides, 
respectively (that is, HBM-p10-pPolyH-gp67). C05 Fab heavy- and light-chain 
regions were synthesized (GenScript), fused to the promoter-signal peptide 
cassette by overlap PCR (heavy chain downstream of pPolyH-gp67 and light 
chain downstream of p10-HBM) and ligated into the Sfil sites of a derivative of 
pFastBacDual. A His, tag was introduced at the C terminus of the light chain to 
facilitate purification. 

Mutations to probe the importance of the CDR H1 insertion were generated by 
site-directed mutagenesis using the QuikChange Lightning Multi kit (Agilent). 
The resulting baculovirus transfer vectors were used to generate recombinant 
bacmids using the Bac-to-Bac system (Invitrogen) and virus was rescued by 
transfecting purified bacmid DNA into Sf9 cells using Cellfectin II 
(Invitrogen). C05 Fab proteins were produced by infecting suspension cultures 
of Hi5 cells (Invitrogen) with recombinant baculovirus at a multiplicity of infec- 
tion (MOI) of 5-10 and incubating at 28 °C, shaking at 110 r.p.m. After 72 h, the 
cultures were clarified by two rounds of centrifugation at 2,000g and 10,000g at 
4°C. The supernatant containing secreted, soluble Fab was concentrated and 
buffer exchanged into 1 PBS, pH 7.4. After metal-affinity chromatography 
using nickel-nitrilotriacetic acid (Ni-NTA) resin, Fabs were purified by protein 
G affinity chromatography (GE Healthcare) and gel filtration (Superdex 200). 
ELISA. Microtitre plates were coated with 0.1 ml of the following HA antigens 
diluted in coating buffer and incubated overnight at room temperature: 100 ng 
ml-' H5N1 Vietnam 1203/04, 700ngml' H1INI New Caledonia/20/99 and 
100 ng ml’ H3N2 Wisconsin/67/05. Blocking was done with 0.3 ml of blocking 
buffer (4% non-fat dry milk in PBS and 0.05% Tween-20). After blocking, 
antibodies were diluted to 0.5 1g ml’ in 2% non-fat dry milk blocking buffer 
and were incubated for 2h at 4°C, washed and later detected using a 1:3,000 
dilution of peroxidase-conjugated anti-human Fe antibody (Jackson 
ImmunoResearch) in 2% non-fat dry milk blocking buffer and standard TMB 
substrate detection. Attenuance at D4sonm was read, and data recorded are 
reported herein. 

Viruses. Recombinant influenza viruses were generated using reverse genetics as 
previously described in ref. 39. In brief, 1 ig each of ten plasmids was transfected 
into 293T cells in monolayer. Each transfection contained ambisense plasmids 
(for the expression of both viral RNAs and messenger RNAs) for the A/Puerto 
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Rico/8/34/PA, PB1, PB2, NP, M and NS segments, in addition to viral RNA 
(pPOLI type) and protein expression plasmids (pCAGGS type) for A/X-31/ 
1968 HA and NA (pCAGGS expression plasmid was kindly provided by J. 
Miyazaki"). Twenty hours after transfection, 293T cells were re-suspended in 
cell culture supernatant and used to inoculate 10-day-old embryonated eggs. 
Microneutralization. Antibodies were screened for neutralizing activity against 
viruses as follows. Twofold serial dilutions of each monoclonal antibody were 
incubated with 100 50% tissue culture infective doses of virus in PBS at 37 °C for 
1h. Madin-Darby canine kidney cell monolayers in 24-well plates were washed 
once with PBS and inoculated with virus-antibody mixtures. After incubation for 
1h at 37°C in 5% CO), the inoculum was removed and monolayers were again 
washed once with PBS. Opti- MEM supplemented with 0.3% BSA, 0.01% FBS and 
1 pg ml tosyl phenylalanyl chloromethyl ketone-treated trypsin was added and 
cells were incubated for 72 h at 37 °C. The presence of virus in cell culture super- 
natants was assessed by HA assays using 0.5% chicken red blood cells. 
Haemagglutination inhibition. Haemagglutination-inhibition titres were deter- 
mined using four haemagglutinating units of HA protein or virus. HA protein or 
virus was incubated for 1 h at room temperature with serial twofold dilutions of 
C05 IgG in 96-well, V-bottom microtitre plates (Costar) followed by addition of 
0.5% chicken red blood cells and mixing. The mixture was incubated for 30 min at 
room temperature followed by brief centrifugation at 1,000 r.p.m. to pellet the red 
blood cells. 

Generation of escape mutants. In brief, H3N2 A/X-31/1968 virus was incubated 
with a neutralizing excess of monoclonal antibody for 45 min at room temper- 
ature. The virus-monoclonal mix was inoculated into 10-day-old embryonated 
eggs and incubated at 37 °C for 48 h. Eggs were inoculated in triplicate. The virus 
(allantoic fluid) grown under these conditions was collected and tested for virus 
outgrowth by HA assay. No viral growth was observed by haemagglutination of 
chicken red blood cells. The experiment was performed twice. 

In vivo protection. Animal experiments were performed in accordance with the 
guidelines of the Mount Sinai School of Medicine and St Jude Children’s Research 
Hospital Institutional Animal Care and Use Committees (IACUC). All animal 
experiments were done essentially as described in ref. 8. Virus strains and titres 
used in these studies were HIN1 A/Memphis/3/2008 (25 MLDso, 1 X 10° 50% egg 
infective doses) and H3N2 A/X-31/1968 (33 MLDsp, 300 plaque-forming units). 
Mice were 5-6-week-old BALB/c for the H1N1 studies and DBA/2 for the H3N2 
studies. 

Cloning, expression and purification of influenza A HAs. On the basis of H3 
numbering, complementary DNAs corresponding to residues 11-329 (HA1) and 
1-176 (HA2) of the influenza A HA ectodomain were fused to an amino-terminal 
gp67 signal peptide and to a C-terminal biotinylation site (amino acid sequence: 
GGGLNDIFEAQKIEWHE), trimerization domain and His,-tag by overlap PCR, 
essentially as previously described in ref. 12. The trimerization domain and 
Hisg-tag were separated from the HA ectodomain by a thrombin cleavage site. 
In the case of the HK68/H3 trimer construct used for crystallization, the 
biotinylation site was omitted from the construct. For the HK68/HA1 subunit 
used for crystallization, HA1 residues 43-309 were fused to the gp67 signal 
peptide (N terminus) and a C-terminal His,-tag. The resulting PCR products 
were digested with Sfil and inserted into a custom baculovirus transfer vector 
(pDCE198). The production of virus and expression conditions were essentially 
as above for the C05 Fabs. After metal-affinity chromatography using Ni-NTA 
resin, trimeric HAs for crystallography were digested with trypsin (New England 
Biolabs, 5 mU trypsin per mg HA, 16h at 17 °C) to produce uniformly cleaved 
(HA1/HA2), and to remove the trimerization domain and Hisg-tag. After 
quenching the digests with 2 mM phenylmethylsulphony] fluoride, the digested 
material was further purified by anion-exchange chromatography (MonoQ, GE 
Healthcare, 10 mM Tris, pH 9.0, 50-1,000 mM NaCl) and size-exclusion chro- 
matography (Superdex 200, GE Healthcare, 10 mM Tris, pH 8.0, 150 mM NaCl). 
Expression and purification of BirA. Escherichia coli biotin ligase (BirA 
enzyme) was expressed and purified in a manner similar to previous reports*’, 
but with an N-terminal His tag, essentially as previously described in ref. 12. In 
brief, a pET21a derivative containing the E. coli birA gene (pDCE095) was trans- 
formed into BL21(DE3) cells, grown in shake flasks in low salt LB medium at 
37 °C to an attenuance (at Déoonm) of ~0.7, then shifted to 23 °C and induced 
with the addition of isopropyl-B-p-thiogalactopyranoside (IPTG) to a final con- 
centration of 1 mM. The culture was incubated at 23 °C for ~16 h after induction. 
The cells were lysed and homogenized by two passes through an EmulsiFlex C-3 
cell disruptor (15kPSI), and birA was purified by nickel affinity (NiNTA resin, 
Qiagen), anion exchange (MonoQ column, GE Healthcare) and gel filtration. 
Purified BirA protein was concentrated to 5 mg ml! in 50 mM Tris, pH 7.5, 
200 mM potassium chloride and 5% glycerol, aliquoted, flash frozen in liquid 
nitrogen and stored at —80°C. 
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Biotinylation and purification of influenza HAs for binding studies. After 
expression and Ni-NTA purification of influenza A HA as described above, proteins 
for binding studies were concentrated to ~2-5mgml ' total protein. The HAs 
were biotinylated by the addition of 25 ug BirA enzyme per mg total protein, in a 
buffer of the following composition: 100 mM Tris, pH 8.0, 10mM ATP, 10mM 
magnesium acetate and 50 uM biotin, with less than 50 mM NaCl. The biotinyla- 
tion reactions were incubated at 37 °C for 1-2 h. Biotinylated HAs were purified by 
size-exclusion chromatography and concentrated to ~5-20 mg ml '. 

Ka determination by BLI. Kg was determined by BLI using an Octet Red instru- 
ment (ForteBio). Biotinylated HAs, purified as described above, were used for 
these measurements. HAs at ~ 10-50 pig ml | in 1X kinetics buffer (1 X PBS, pH 
7A, 0.01% BSA and 0.002% Tween 20) were loaded onto streptavidin-coated 
biosensors and incubated with varying concentrations of C05 Fab in solution. 
All binding data were collected at 30°C. The experiments comprised five steps: 
(1) baseline acquisition (60 s); (2) HA loading onto sensor (300-600 s); (3) second 
baseline acquisition (180-200 s); (4) association of C05 for the measurement of 
kon (180 s); and (5) dissociation of C05 for the measurement of kg (180 s). Four to 
six concentrations of C05 were used, with the highest concentration varying, 
depending on the HA affinity, from 50 to 1,000 nM. Baseline and dissociation 
steps were carried out in buffer only. The ratio of k,, to ko determines the Ky 
reported here. The sequences of all proteins used in this work are available in 
FASTA format at the end of this document. All binding traces and curves used for 
fitting k,, and ko are reported in Supplementary Figs 14 and 15. 

Kg determination by ITC. ITC was performed using an ITC 99 calorimeter 
(Microcal). All proteins were dialysed extensively against PBS buffer before the 
titrations. The experiments were performed at 25 °C. A typical titration consisted 
of injecting 1.6 1] aliquots of C05 Fab (0.5 mM) into 70 1M HA solution in the cell 
at time intervals of 90 s to ensure that the titration peak returned to the baseline. 
ITC data were corrected for the heat of dilution by subtracting the mixing 
enthalpies for titrant solution injections into protein-free buffer. ITC data were 
analysed using program Origin (Version 7.0) as provided by the manufacturer 
using a single set of identical binding sites model. 

Owing to the high molecular weight of the system and the very poor expression 
of the A/Brisbane/ 10/2007 (H3N2) HA, we were unable to achieve a high enough 
molar concentration of HA in the ITC cell to obtain a complete, high-quality 
titration curve. For a Ky of 10 uM, the manufacturer’s guidelines suggest 0.5 mM 
HA in the cell and 5 mM Fab in the syringe. This would correspond to ~30 mg 
ml ' HA (~10 mg total) and ~250 mg ml! Fab (~20 mg total). However, we 
were only able to obtain ~1 mg of the A/Brisbane/10/2007 (H3N2) HA from 101 
of insect cell culture, precluding the possibility of obtaining the quantity of 
protein required for an optimal experiment. In addition, although C05 Fab could 
be obtained in good yields and could be concentrated to at least 50mg ml" * 
(~1 mM), clear self-association between C05 molecules was apparent even at 
~19mg ml! (~0.38 mM), the concentrations used in the experiment reported 
here. However, after subtraction of Fab injections into a buffer-filled cell to 
correct for the heat of dilution resulting from the Fab self-association at high 
concentrations, the resulting data could still be fit reasonably well and an approx- 
imate value for the Ky could be obtained (see Supplementary Fig. 16). 

Electron microscopy of C05 Fab with H1, H2 and H3 HAs. Negative-stained 
grids were prepared by applying 0.1 mg ml ' of the purified complex to a freshly 
glow-discharged carbon-coated 400 Cu mesh grid and stained with Nano-W. 
Grids were viewed using a FEI Tecnai TF20 electron microscope operating at 
an accelerating voltage of 120kV and imaged at a magnification of < 100,000. 
Images were acquired on a Gatan 4k X 4k charge-coupled device (CCD) camera 
in five-degree tilt increments from 0 to 55° using a de-focus range of 700 to 
900 nm. The tilt angles provided additional particle orientations to improve the 
image reconstruction. The pixel size of the CCD camera was calibrated at this 
magnification to be 1.09 A using a two-dimensional catalase crystal with known 
cell parameters. 

Image processing. For each complex, particles were automatically selected from 
micrographs using the DoG Picker software through the Appion package**”’. The 
contrast transfer function estimation for micrographs was completed using 
ctffind3 and ctfttilt**“*. Particles were binned by four (80 X 80-sized boxes) and 
reference free two-dimensional class averages were calculated using the Sparx 
package, without imposing symmetry“’. An unliganded 25A hemagglutinin (H1) 
volume generated with Sparx was used as the initial model for the image recon- 
struction process, with the reference-free class averages as the experimental 
images. Density for the Fabs could be visualized after 3 cycles of refinement. 
Refinement was continued for 70 cycles with decreasing angular search para- 
meters. The resultant model was then used as the initial model for the iterative 
image reconstruction process against the raw particles binned by two (160 X 160- 
sized boxes). Contrast transfer function correction was applied during this pro- 
cedure using the Sparx package. C3 symmetry was imposed throughout the image 


reconstruction process. The final three-dimensional image reconstructions have 
an estimated resolution of 17A according to a Fourier shell correlation (FSC) 
of 0.5. 

Model fitting. Coordinates of HA from A/Solomon Islands/3/2006 (H1N1) and 
A/Japan/305/1957 (H2N2) in addition to C05 were fitted into the respective 
electron microscopy reconstructions. For H3, the coordinates of HA from 
A/Hong Kong/1/1968 (H3N2) were used in the electron microscopy reconstruc- 
tion of HA from A/Perth/16/2009 (H3N2) with C05 Fab. The automated fit of 
each component was manually improved through rigid body movement using the 
program UCSF Chimera to minimize the number of clashes between HA and the 
Fabs while maintaining the highest possible correlation coefficient value with the 
chimera fit tool. This resulted in a portion at the base of the stem region of HA 
being excluded from the image reconstruction. Coincidentally, this region of HA 
possesses high B-factors in the crystal structures and may adopt distinct confor- 
mations on the electron microscope grid. The density for this region can be seen at 
lower contour thresholds. 

Crystallization and structure determination of the HK68/H3 HA. Gel- 
filtration fractions containing HK68/H3 HA were concentrated to ~9 mg ml 
in 10 mM Tris, pH 8.0, and 50 mM NaCl. Initial crystallization trials were set up 
using the automated Rigaku CrystalMation robotic system at the Joint Center for 
Structural Genomics (http://www,jcsg.org). The crystal used for data collection 
was grown by the sitting drop vapour diffusion method with a reservoir solution 
(200 ul) containing 2.0M ammonium sulphate, 100mM sodium cacodylate, 
pH6.5, and 200mM NaCl. Drops consisting of 100nl protein + 100 nl 
precipitant were set up at 20°C, and crystals appeared within 3-7 days. The 
resulting crystals were cryoprotected by brief immersion in well solution supple- 
mented with 35% glycerol, then flash-cooled and stored in liquid nitrogen until 
data collection. 

Diffraction data for the HK68/H3 HA were collected on beamline 11-1 at the 

SSRL. The data were indexed in space group C222,, integrated and scaled using 
HKL2000 (HKL Research). The structure was solved by molecular replacement to 
1.90 A resolution using Phaser*®. A protomer from 2VIU"* (A/X-31/1968 (H3N2) 
HA) was used as the initial search model and three copies (forming a non- 
crystallographic trimer) were found in the asymmetric unit. At this point, poorly 
defined regions of the model were trimmed and the HA was mutated in silico to 
match the HK68/H3 sequence. Rigid body refinement, simulated annealing and 
restrained refinement (including translation-libration-screw (TLS) refinement, 
with one group for each HA1 or HA2 domain) were carried out in Phenix’. 
Riding hydrogens were used during refinement and are included in the final 
model. Between rounds of refinement, the model was built and adjusted using 
Coot"®. Waters were built automatically using the ‘ordered_solvent’ modelling 
function in Phenix’’. Refinement statistics can be found in Supplementary 
Table 1. 
Crystallization and structure determination of the C05 Fab. Gel-filtration 
fractions containing the C05 Fab were concentrated to ~11 mg ml’ in 10mM 
Tris, pH 8.0, and 50 mM NaCl. Initial crystallization trials were set up using the 
automated Rigaku CrystalMation robotic system at the Joint Center for Structural 
Genomics (http://www.jcsg.org). Several hits were obtained, with the most 
promising candidates grown in ~2.0-2.5 M ammonium sulphate between pH 5 
and 7. Optimization of these conditions resulted in diffraction-quality crystals. 
The crystals used for data collection were grown by the sitting drop vapour 
diffusion method with a reservoir solution (1 ml) containing 1.8 M ammonium 
sulphate, 100 mM sodium acetate, pH 5.4, and 50 mM NaCl. Drops consisting of 
0.5 pl protein + 0.5 pl precipitant were set up at 4 °C and crystals appeared within 
3-7 days. The resulting crystals were cryoprotected by soaking in well solution 
supplemented with increasing concentrations of ethylene glycol (5% steps, 1 h per 
step) to a final concentration of 20%, then flash-cooled and stored in liquid 
nitrogen until data collection. 

Diffraction data for the C05 Fab crystals were collected on the GM/CA-CAT 
23ID-D beamline at the Advanced Photo Source (APS) at Argonne National 
Laboratory. The data were indexed in space group P4,2,2, integrated, scaled, 
and merged using HKL2000 (HKL Research). The structure was solved by 
molecular replacement to 2.30 A resolution using Phaser’. PDB codes 1AQK 
and 1VGE were used as search models for the variable and constant domains, 
respectively, and 2 complete Fabs were found in the asymmetric unit. Rigid body 
refinement, simulated annealing and restrained refinement (including TLS 
refinement, with one group for each Ig domain) were carried out in Phenix”. 
Riding hydrogens were used during refinement and are included in the final 
model. Between rounds of refinement, the model was built and adjusted using 
Coot"®. Waters were built automatically using the ‘ordered_solvent’ modelling 
function in Phenix’’. Refinement statistics can be found in Supplementary 
Table 1. 
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Crystallization and structure determination of the C05-HK68-H3 HA1 
complex. Gel-filtration fractions containing the C05-HK68/H3 HA1 complex 
were concentrated to ~9 mg ml! in 10mM Tris, pH 8.0, and 50mM NaCl. 
Initial crystallization trials were set up using the automated Rigaku 
CrystalMation robotic system at the Joint Center for Structural Genomics 
(http://www.jcsg.org). The crystals used for data collection were grown by the 
sitting drop vapour diffusion method with a reservoir solution (200 pl) contain- 
ing 10% (w/v) polyethylene glycol (PEG) 3000, 200 mM zinc acetate and 100 mM 
sodium acetate, pH 4.5. Drops consisting of 100 nl protein + 100 nl precipitant 
were set up at 4 °C, and crystals appeared within 3 days and continued to grow 
larger until approximately day 7. The resulting crystals were cryoprotected by 
brief immersion in well solution supplemented with 30% ethylene glycol, then 
flash-cooled and stored in liquid nitrogen until data collection. 

Diffraction data for the C05-HK68/H3 HA1 complex were collected on the 

GM/CA-CAT 23ID-D beamline at the APS at Argonne National Laboratory. The 
data were indexed in space group P2), integrated, scaled and merged using 
HKL2000 (HKL Research). The structure was solved by molecular replacement 
to 2.95 A resolution using Phaser“®. The isolated HA1 subunit from the HK68/H3 
structure reported here was used as a search model and four copies were found in 
the asymmetric unit. A subsequent search with the variable and constant domains 
from the un-liganded C05 Fab structure yielded good solutions for four complete 
Fab fragments. Rigid body refinement, simulated annealing and restrained refine- 
ment (including TLS refinement, with one group for HAI and one for each 
immunoglobulin domain) were carried out in Phenix’. Riding hydrogens were 
used during refinement and are included in the final model. Between rounds of 
refinement, the model was built and adjusted using Coot**. Refinement statistics 
can be found in Supplementary Table 1. 
Crystallization and structure determination of the CO5-HK68-H3 HA trimer 
complex. Gel-filtration fractions containing the CO5-HK68-H3 trimeric HA 
complex were concentrated to ~10mg ml! in 10mM Tris, pH 8.0, and 
50mM NaCl. Initial crystallization trials were set up using the automated 
Rigaku CrystalMation robotic system at the Joint Center for Structural 
Genomics (http://www.jcsg.org). Only two hits were obtained (in related condi- 
tions) and neither could be reproduced or optimized further. Thus, the crystals 
from the initial screen were used for data collection. The best crystal was grown by 
the sitting drop vapour diffusion method with a reservoir solution (200 ll) con- 
taining 28% (w/v) PEG 400, 200mM calcium chloride and 100 mM HEPES 
buffer, pH 7.5. Drops consisting of 100 nl protein + 100 nl precipitant were set 
up at 4°C and a very small crystal was visible after 14 days. Over the course of a 
further 2 weeks, the crystal reached it maximum size. The crystal was cryopro- 
tected by brief immersion in well solution supplemented with 10% ethylene 
glycol, then flash-cooled and stored in liquid nitrogen until data collection. 

Diffraction data for the C05-HK68/H3 trimer complex were collected on the 
GM/CA-CAT 23ID-B beamline at the APS at Argonne National Laboratory. 
Owing to poor crystal quality (cluster of stacked plates), the diffraction images 
were noisy and highly anisotropic. The data were indexed in space group PI, 
integrated and scaled using HKL2000 (HKL Research) and merged using XPREP 
(Bruker). The structure was solved by molecular replacement to a nominal reso- 
lution of 4.25 A using Phaser**. However, owing to the anisotropy of the diffrac- 
tion, many reflections were observed at a higher resolution (~3.2 A) than the 
nominal resolution reported, and these additional data were included in refine- 
ment. A HK68-H3 trimer from the high-resolution structure reported here was 
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used as the initial search model and total of four trimers (12 HA protomers) were 
found in the asymmetric unit. The CO5-HK68-H3 HA1 complex was docked 
onto each of the 12 HA1 subunits in the asymmetric unit to position the C05 Fab. 
Rigid body refinement, simulated annealing and restrained refinement (including 
TLS refinement, with one group for HA1, one for HA2 and one for each immuno- 
globulin domain) were carried out in Phenix’’. Riding hydrogens were used 
during refinement and are included in the final model. Between rounds of refine- 
ment, the model was built and adjusted using Coot*. Refinement statistics can be 
found in Supplementary Table 1. 

Structural analyses. Hydrogen bonds and van der Waals contacts between C05 
and the HK68/H3 HA1 domain HA were calculated using HBPLUS and 
CONTACSYM, respectively. Surface area buried on Fab binding was calculated 
with MS. MacPyMol (DeLano Scientific) and UCSF Chimera” was used to render 
structure figures and for general manipulations. Kabat numbering was applied to 
the coordinates using the AbNum server”. The final coordinates were validated 
using the JCSG quality-control server (v2.7), which includes MolProbity”’. 
Sequence analysis of C05 epitope conservation in all HA proteins. All full- 
length, non-redundant and non-lab-strain influenza A HA sequences were 
downloaded from the NCBI Flu database*’. At the time of download (7 August, 
2010), the data set included 8,720 sequences encompassing all 16 influenza A 
subtypes, of which 5,813 and 2,907 sequences were from group 1 and group 2 
viruses, respectively. The sequences were aligned using MUSCLE” and analysed 
using Genetics Computer Group (Accelrys) and custom shell scripts (available 
from the authors on request). In this analysis, residues are considered conserved if 
substitutions are restricted to other amino acids in the same group: (1) Asp, Asn, 
Glu, Gln (negatively charged or isosteric polar substitutions); (2) Val, Ile, Leu, 
Met (small hydrophobic); (3) Phe and Tyr (aromatic); and (4) Ser and Thr (small 
polar/hydroxyl). The values reported for percent conservation are the number to 
sequences with an identical or conservative change at a position divided by the 
total number of sequences and multiplied by 100. 
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No surviving evolved companions of the progenitor 


of SN 1006 


Jonay I. Gonzalez Hernandez!?, Pilar Ruiz-Lapuente*’, Hugo M. Tabernero”, David Montes”, Ramon Canal‘, Javier Méndez*® 


& Luigi R. Bedin’ 


Typela supernovae are thought to occur when a white dwarf 
made of carbon and oxygen accretes sufficient mass to trigger a 
thermonuclear explosion’. The accretion could be slow, from an 
unevolved (main-sequence) or evolved (subgiant or giant) star”? 
(the single-degenerate channel), or rapid, as the primary star 
breaks up a smaller orbiting white dwarf** (the double-degenerate 
channel). A companion star will survive the explosion only in the 
single-degenerate channel’. Both channels might contribute to the 
production of type Ia supernovae®’, but the relative proportions of 
their contributions remain a fundamental puzzle in astronomy. 
Previous searches for remnant companions have revealed one 
possible case for SN 1572 (refs 8, 9), although that has been 
questioned”. More recently, observations have restricted surviving 
companions to be small, main-sequence stars'’’, ruling out giant 
companions but still allowing the single-degenerate channel. Here 
we report the results of a search for surviving companions of the 
progenitor of SN 1006 (ref. 14). None of the stars within 4 arc 
minutes of the apparent site of the explosion is associated with 
the supernova remnant, and we can firmly exclude all giant and 
subgiant stars from being companions of the progenitor. In 
combination with previous results, our findings indicate that fewer 
than 20 per cent of type Ia supernovae occur through the single- 
degenerate channel. 

Together with SN1572 (Tycho Brahe’s supernova), SN 1604 
(Kepler’s supernova) and the recently identified SN 185, SN 1006 is 
one of only four known historical Galactic type Ia supernova events. It 
is also the only one whose type has never been disputed. Whereas a 
survey of the stars close to the centre of Tycho’s supernova remnant 
(SNR) produced a likely candidate for the companion*”, and the 
supernova may thus be attributed to accretion via the single- 
degenerate channel, the absence of a surviving companion in the 
supernova remnant SNR0509-67.5, in the Large Magellanic Cloud, 
down to very faint magnitudes, strongly suggests that the supernova 
there was the result of accretion via the double-degenerate channel’. 
Although the aforementioned direct searches have excluded, up to 
now, red giant companions, there is some evidence from nearby spiral 
galaxies that a fraction of type Ia supernovae may have had companions 
of this type’®. This hypothesis, however, is challenged by the absence of 
ultraviolet emission, which would be expected at the beginning of the 
supernova outburst’”. 

The distance to the remnant of SN 1006 (2.18 + 0.08 kpc away, as 
determined from the expansion velocity and the proper motion of the 
ejecta'*) is much more precisely known than that of the remnant of 
SN 1572 (2.83 0.70 kpc). The interstellar extinction is also much 
smaller, which makes the distances to the stars less uncertain. To be 
a possible candidate, a star must be at the correct distance and, depend- 
ing on its spectral type and luminosity class, must show some spectral 
peculiarity or an enhancement of the abundances of Fe-peak elements 


such as that seen in star G of Tycho’s supernova’. Because of its relative 
proximity compared with SN 1572, and the lower extinction (V-band 
extinction of Ay = 0.3 mag, versus 2.0-2.4 mag in Tycho’s supernova), 
going down to an R-band magnitude of mp = 15 mag allows us to 
include, out to the supernova distance and beyond, all red giants, 
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Figure 1 | R-band image of the SN 1006 field. Image from the Digitized Sky 
Survey 2. The positions and names of the 26 stars included in the spectroscopic 
survey (selected from the USNO-A2.0 catalogue”) are given. The centre of our 
search is the geometrical centre of the quite symmetrical X-ray emission of the 
SNR” (green cross), at RA = 15h 2 min 55s, dec. = —41° 55’ 12'’. Also shown 
are the boundary of the surveyed region (large green circle) and the geometrical 
centre of the Hx emission’* (small yellow circle). Giant stars are marked by red 
squares. See also Supplementary Fig. 1, for a full view of the SNR. Another 
centre has been proposed more recently”, on the basis of the distribution of the 
ejecta along the line of sight. It is, however, still located within our surveyed 
area. For a star at a distance of d ~ 2.2 kpc and with a velocity of ~100kms * 
(roughly the orbital velocity before the explosion) perpendicular to the line of 
sight, the angular displacement on the sky in 1,000 yr would be only 10’’. 
However, given the asymmetry of the SNR, and also that in core-collapse 
supernovae the distance between the compact object and the X-ray centroid of 
the SNR can be 15% or more of the radius of the SNR, we adopted a much wider 
radius for the search: 4’. That amounts to 27% of the radius of the SNR, which is 
15’ (Supplementary Fig. 1). Positions, magnitudes and angular distances to the 
centre of the survey are given in Supplementary Table 1. 
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Table 1 | Parameters of the sample stars 


Name Tort (K) logig(cms °)] Viurb (km s~*) [Fe/H] (dex) Vraa (kms +) d (kpc) 

BO9472 5853 +75 4.52+0.17 1.211 +0.105 —0.04 + 0.06 —20.74+2.14 1.23 + 0.53 
B11408 4677 +68 2.91 +0.23 2.136 + 0.082 —0.29 + 0.05 —123.54+1.48 1.62 + 0.68 
B05723 5910+87 4.50 + 0.23 1.249 + 0.107 0.30 + 0.07 —12.81 + 3.06 0.76 + 0.32 
B17720 5300 + 86 4.65 + 0.22 1.190 + 0.193 0.33 + 0.06 —60.13 + 1.30 0.58 + 0.25 
B16564 4845 + 39 3.13 + 0.16 1.607 + 0.049 —0.37 + 0.03 —116.07 + 1.86 3.03 + 1.27 
B97338 5707 +42 4.21 +0.12 1.292 + 0.054 0.01 + 0.04 —5.53 + 2.25 1.05 + 0.45 
B05518 6327 + 63 4.85 +0.13 1.386 + 0.090 0.14 + 0.04 —4.38+ 1.92 0.50 + 0.21 
B20292 5196+ 33 3.67 + 0.09 1.372 + 0.051 —0.66 + 0.03 2.92 + 2.52 1.13 + 0.48 
B14130 6177+44 4.61 + 0.09 1.528 + 0.052 0.31 + 0.03 —21.26 + 2.09 0.89 + 0.38 
B03395 5517+49 4.36+0.13 1.165 + 0.074 0.22 + 0.04 —56.18+5.21 0.95+0.41 
B97341 4881+51 2.98 + 0.18 1.641 + 0.057 —0.16 + 0.04 —41.76+1.96 2.48 + 1.04 
B99810 4658 + 39 2.51+0.15 1.782 + 0.039 —0.72 + 0.03 33.53 + 1.50 2.42+1.02 
B15360 4960 + 70 3.40+0.21 1.886 + 0.086 0.07 + 0.05 —37.03 + 1.61 1.29 + 0.54 
B93571 4579 +58 2.51 +0.22 2.182 + 0.066 —0.21+0.05 —100.33 + 1.58 2.39+1.01 
B18024 6083 + 58 444+0.13 1.545 + 0.095 —0.36 + 0.04 22.61 + 2.35 0.60 + 0.26 
B08277 5693 + 34 4.39 + 0.09 1.123 + 0.044 0.17 + 0.03 —49.50 + 2.08 0.45 +0.19 
B24215 5729 +45 444+0.12 1.189 + 0.058 0.16 + 0.04 3.02 + 2.39 0.95+0.41 
B14707 5065 + 47 3.36+0.15 1.270 + 0.060 0.16 + 0.04 22.14 + 1.83 1.37 + 0.58 
B90474 5051 +38 3.05 +0.12 1.874 + 0.046 —0.29 + 0.03 —98.10+ 1.95 4.78 + 2.00 
B90102 5650 + 33 4.21 +0.10 1.076 + 0.049 —0.11 + 0.03 85.57 + 1.16 0.91 +0.39 
B10074 5601 + 63 4.52+0.14 1.036 + 0.102 0.05 + 0.05 —30.26 + 1.38 0.71 + 0.30 
B95979 6776+94 4.19 +0.16 1.832 + 0.131 0.02 + 0.06 —32.94+6.81 0.94 + 0.40 
B09749 6163 +65 4.53 + 0.14 1.362 + 0.082 0.14 + 0.05 —13.15+2.60 0.40 + 0.17 
B26090 6101 +39 4.58 + 0.09 1.238 + 0.053 0.15 + 0.03 —48.08 + 2.52 0.37 + 0.16 


We provide the stellar atmosphere parameters (effective temperature, Ter; surface gravity, log(g); microturbulent velocity, Vrurb), metallicities ([Fe/H]), radial velocities (V;aq) and distances (d) of the sample stars, 
together with 1o uncertainties. The stellar parameters and metallicities were obtained from Fe! and Fei excitation and ionization equilibria, using the StePar®° code, which makes use of the MOOG? code and 


ATLAS9 model atmospheres?’ Our analysis has been performed assuming local thermodynamic equilibrium. SN 1006 is located about 500 pc above the Galactic plane. Unlike the case of SN 1572, which lies 
close to the Galactic plane, the radial velocities of the stars along the line of sight do not follow a regular pattern (this can be seen in the sixth column), because most of them belong either to the thick disc or to the 
halo of the Galaxy (see the dispersion in metallicities in the fifth column). Two of the observed stars, B98824 and B21185, are spectroscopic binaries and were discarded as possible companion stars of the 


progenitor of SN 1006 (Supplementary Information). 


all subgiants and also main-sequence stars down to an absolute mag- 
nitude of My ~ +3.1 mag. Inspecting the Two Micron All Sky Survey”” 
photometric catalogue for possible unevolved companions (Supplemen- 
tary Information), we found no main-sequence stars brighter that 
May ~ 16.4 mag, which brings the limit down to Mg ~ +4.5 mag, cor- 
responding to a V-band absolute magnitude of My ~ +4.9 (approxi- 
mately equal to, or slightly less than, solar luminosity). Slightly evolved 
companions could be somewhat brighter: mpg ~ 16.0 (about twice as 
luminous as the Sun). Only the case of SNR0509-67.5 in the Large 
Magellanic Cloud”* has a fainter limit and has been reached in a direct 
search. 

We derive the stellar atmosphere parameters of the sample stars 
(Fig. 1) using high-resolution spectroscopic data from the Ultraviolet 
and Visual Echelle Spectrograph (UVES) (Table 1). The spectra of four 
giants in the sample are shown in Fig. 2, and those of F- and G-type 
dwarfs are shown in Supplementary Fig. 2. In Table 1, we also provide 
the radial velocities of sample stars measured from the UVES spectra, 
and distances determined from five photometric magnitudes, taking 
into account the stellar parameters (Supplementary Information). In 
Fig. 3, we compare the abundances of Fe-peak elements in the stars of 
our sample with the Galactic trends”, for F-, G- and K-type unevolved 
stars. Unlike in SN 1572, where star G shows an overabundance of Ni 
([Ni/Fe] = log(Nni/Nre) — log(Nni/Npe)o = 0.16 + 0.04, where Nyx is 
the number of atoms of element X and the second logarithm involves 
solar values), the stars in SN 1006 are all within the dispersion of the 
Galactic [Ni/Fe] trend. No enhancement is seen for any other element. 
In Supplementary Fig. 3, we also show the Galactic trends of several 
a-elements and we do not see any clear anomaly in these element 
abundances either. 

None of the stars in the sample shows any noteworthy rotation. 
High rotation speeds have been claimed to be a characteristic of the 
surviving companions of typeIa supernovae’’, on the assumptions 
that, first, owing to tidal interaction and in spite of the angular 
momentum loss due to mass transfer, the rotation periods before 
explosion are equal to the (short) orbital periods, and, second, the 
radius of the star remains basically unchanged after the explosion’. 
It has recently been shown’, however, that in a type Ia supernova the 
impact of the ejecta on the companion does indeed reduce those speeds 
by a large factor, which would make the rotation criterion irrelevant. 


534 | NATURE | VOL 489 | 27 SEPTEMBER 2012 


Only four stars are at distances (marginally) compatible with that of 
the remnant of SN 1006. All of them are red giants: B16564 (G9-KO 
II), B97341 (G9 III), B99810 (K1 III) and B93571 (K1 II) and none 
shows any spectroscopic peculiarity. 

Two-dimensional hydrodynamic simulations of the impact of the 
ejecta of a type Ia supernova on a red giant binary companion have 
been performed”. More recently, the emission of supernova debris, 
arising from their impact with a similar companion, has been also 
considered”. In terms of the effects of the explosion on the companion 
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Figure 2 | Observed UVES spectra of the candidate giant stars. The spectra 
are labelled by their names in Table 1 and sorted in order of decreasing effective 
temperature from top to bottom. These high-resolution spectra were obtained 
at the 8.2-m Kueyen VLT (UT2) telescope equipped with UVES at the Cerro 
Paranal Observatory, Chile. They were obtained on 13, 14, 25 and 28-30 April 
2002 and on 1 May 2002, and cover the spectral regions 3,295-5,595 A, 5,680- 
6,645 A, 6,705-8,515 A and 8,675-10,420 A at a resolving power of 

2/Ad ~ 43,000. The signal-to-noise ratio of the spectra is in the range 50-170 
and is typically 80. The data were reduced in a standard manner, and later 
normalized with the software IRAF, using low-order polynomial fits to the 
observed continuum. 
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Figure 3 | Stellar abundance ratios [X/Fe] of several Fe-peak elements. The 
chemical abundances have been derived using the equivalent-width technique 
for each element: chromium (a), manganese (b), cobalt (c) and nickel (d). We 
performed a differential analysis on a line-by-line basis, using the solar UVES 
spectrum of the Moon as reference (see Supplementary Information, section 3, 
for more details). The abundances of several Fe-peak elements relative to iron 


star, the results agree: most of the envelope is stripped away and what is 
left are the degenerate core and a very small fraction (amounting to a 
few per cent) of the original envelope. This hydrogen-rich envelope 
initially extends out to a radius of ~350 solar radii and then contracts 
on a thermal timescale. The star should be evolving at constant 
luminosity (ofabout 1,000 times the solar luminosity) towards increas- 
ing effective temperatures for 10°-10° yr. The same conclusion is 
reached when extrapolating to the red giant case the results of simula- 
tions of the impact of typelIa supernova ejecta on a main-sequence 
companion: the red giant should have been stripped of most of its 
hydrogen envelope. 

Nothing similar to any of the above four normal red giants would be 
seen. However, the peculiar type of object that would result from the 
interaction of the ejecta with a red giant companion would be luminous 
enough to have been seen at the distance of SN 1006 and within our 
magnitude limit. A subgiant star similar to star G in Tycho’s supernova 
would equally have been seen, but the only three subgiants in the 
sample (B20292, B15360 and B14707) lie much closer than the SNR. 
We are thus left with only main-sequence stars, which lie at shorter 
distances than the SNR; have luminosities similar to, or lower than, the 
solar luminosity; and are not predicted by any of the hydrodynamic 
simulations of the impact of typeIa supernova ejecta with either a 
main-sequence star or a subgiant. In these hydrodynamic simula- 
tions”, a main-sequence companion of one solar mass is expanded 
and heated, reaching ~5,000 times the solar luminosity; it is then 
predicted to contract and cool down on a thermal timescale. It has 
been found that the return to luminosities of the order of those before 
the explosion could be faster because of the short cooling times of the 
outermost layers of the star’, but, even so, in only ~1,000 yr the object 
does not have enough time to become dimmer than the Sun 
(Supplementary Information). 

Accordingly, SN 1006, the brightest event so far observed in our 
Galaxy, should have been produced either by mass accretion from 
an unevolved star similar to, or less massive than, the Sun (with the 
above caveats), or by merging with another white dwarf. Adding this 
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[Mn/Fe] 


[Ni/Fe] 


[Fe/H] 


(Supplementary Table 4b) are compared with the Galactic trends of these 
elements (black circles) in the relevant range of metallicities”. Red triangles 
correspond to the four giant stars whose distances are marginally compatible 
with that of the remnant of SN 1006. Blue squares correspond to the rest of the 
stars in the sample. The error bars are the lo uncertainties associated with the 
dispersion of the measurements from different spectral features. 


result to the evidence from the other direct searches, the single-degen- 
erate channel seems either to encompass only a clear minority of cases 
(20% or fewer) or preferentially involves main-sequence companions 
with masses more probably below that of the Sun. 
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Popularity versus similarity in growing networks 


Fragkiskos Papadopoulos!, Maksim Kitsak”, M. Angeles Serrano*, Marian Bogufia® & Dmitri Krioukov’ 


The principle’ that ‘popularity is attractive’ underlies preferential 
attachment’, which is a common explanation for the emergence of 
scaling in growing networks. If new connections are made pref- 
erentially to more popular nodes, then the resulting distribution of 
the number of connections possessed by nodes follows power 
laws**, as observed in many real networks**. Preferential attach- 
ment has been directly validated for some real networks (including 
the Internet”*), and can be a consequence of different underlying 
processes based on node fitness, ranking, optimization, random 
walks or duplication’"**. Here we show that popularity is just one 
dimension of attractiveness; another dimension is similarity'”™*. 
We develop a framework in which new connections optimize certain 
trade-offs between popularity and similarity, instead of simply pre- 
ferring popular nodes. The framework has a geometric interpretation 
in which popularity preference emerges from local optimization. As 
opposed to preferential attachment, our optimization framework 
accurately describes the large-scale evolution of technological (the 
Internet), social (trust relationships between people) and biological 
(Escherichia coli metabolic) networks, predicting the probability of 
new links with high precision. The framework that we have developed 
can thus be used for predicting new links in evolving networks, and 
provides a different perspective on preferential attachment as an 
emergent phenomenon. 

Nodes that are similar have a higher chance of getting connected, 
even if they are not popular. This effect is known as homophily in social 
sciences’”"*, and it has been observed in many real networks’?**. In the 
web”, for example, an individual creating her new homepage tends 
to link it not only to popular sites such as Google or Facebook, but also 
to not-so-popular sites that are close to her special interests—for 
example, sites devoted to the composer Tartini or to free solo climbing. 
These observations suggest the introduction of a measure of attractive- 
ness that would somehow balance popularity and similarity. 

The simplest proxy for popularity is the node birth time. All other 
things being equal, older nodes have more chances to become popular 
and attract connections**. If nodes join the network one by one, then 
the node birth time is simply the node number t = 1, 2, .... To model 
similarity, we randomly place nodes on a circle that represents the 
simplest similarity space. That is, the angular distances between nodes 
model their similarity distances, such as the cosine similarity or any 
other measure” *. The simplest way to model a balance between popu- 
larity and similarity is then to establish new connections that optimize 
the product between popularity and similarity. In other words, the 
model is simply as follows: (1) initially the network is empty; (2) at 
time t= 1, new node t appears at a random angular position 0, on the 
circle; and (3) new node t connects to a subset of existing nodes s, s < t, 
consisting of the m nodes with the m smallest values of product s0,,, 
where m is a parameter controlling the average node degree k= 2m, and 
0,, is the angular distance between nodes s and t (Fig. 1a, b). At early 
times t = m, node t connects to all the existing nodes. 

This model has an interesting geometric interpretation, shown in 
Fig. 1c. Specifically, after mapping birth time ¢ of a node to its radial 
coordinate 7; via r; = In ¢, all nodes lie not on a circle but on a plane— 
their polar coordinates are (1; 0;). It then turns out that new nodes 


connect simply to the closest m nodes on the plane, except that 
distances are not Euclidean but hyperbolic”. The hyperbolic distance 
between two nodes at polar coordinates (r,, 0,) and (r, 0,) is approxi- 
mately x= 1, + 7; + In(0,/2) = In(st0,,/2). Therefore the sets of 
nodes s minimizing x,, or s0,, for each t are identical. The hyperbolic 


Figure 1 | Geometric interpretation of popularity x similarity 
optimization. The nodes (dots) are numbered by their birth times, and located 
at random angular (similarity) coordinates. On its birth, the new circled node t 
in the yellow annulus connects to m old nodes s minimizing s0,,. The new 
connections are shown by the thicker blue links. In a and b, tf = 3 and m = 1. In 
a, node 3 connects to node 2 because 2033 = 21/3 < 10,3 = 51/6. In b, node 3 
connects to node 1 because 10)3 = 27/3 < 2033 = n. In ¢, an optimization- 
driven network with m = 3 is simulated for up to 20 nodes. The radial 
(popularity) coordinate of new node t = 20 is r, = In t, shown by the long thick 
arrow. This node connects to the three hyperbolically closest nodes. The red 
shape marks the set of points located at hyperbolic distances less than r, from 
the new node. Arrows on dots show all nodes drifting away from the crossed 
origin, emulating popularity fading as explained in the text. The drift speed in 
the network shown corresponds to the degree distribution exponent y = 2.1. 
The outer green circle shows the current network boundary of radius r, = In t 
expanding with time t as indicated by green arrows. 
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distance is then nothing but a convenient single-metric representation 
of a combination of the two attractiveness attributes, radial popularity 
and angular similarity. We will use this metric extensively below. 

The networks grown as described may seem to have nothing in 
common with preferential attachment (PA)?*. Yet we show in 
Fig. 2a that the probability /7(k) that an existing node of degree k 
attracts a connection from a new node is the same linear function of 
k in the described model and in PA. It is not surprising then that the 
degree distributions in PA and in our model are the same power laws. 
In Supplementary Information section IV, we prove that the exponent 
y of this power law approaches 2. Preferential attachment thus emerges 
as a process originating from optimization trade-offs between popu- 
larity and similarity. 

However, there are crucial differences between such optimization 
and PA. In the latter, new nodes connect with the same probability 
II(k) to any nodes of degree k in the network. In the former, new nodes 
connect only to specific subsets of such k-degree nodes that are closest 
to the new node along the similarity dimension 0 (Fig. 1c). To quantify, 
we compare in Fig. 2b the probability of connection between a pair of 
nodes as a function of their hyperbolic distance in the two cases. We see 
that close nodes are almost always connected in the optimization 
model, whereas in PA the probability of their connection is lower by 
an order of magnitude. On the other hand, nodes that are far apart are 
never connected in the optimization model, whereas they can be con- 
nected in PA. These differences manifest themselves in the strength of 
clustering, which is the probability that two nodes connected to the 
same node are also connected to each other. In PA, clustering is 
asymptotically zero*®, whereas it is strong in many real networks*®. 
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Figure 2 | Emergence of PA from popularity x similarity optimization. 
Two growing networks have been simulated up to t = 10° nodes, one growing 
according to the described optimization model, and the other according to PA. 
In both networks, each new node connects to m = 2 existing nodes. The y > 2 
limit is not well-defined in PA, so that y = 2.1 is used instead as described in the 
text.a, The probability /7(k) that an existing node of degree k attracts a new link. 
The solid line is the theoretical prediction, while the dashed line is a linear 
function, [/(k) «x k. b, The probability p(x) that a pair of nodes located at 
hyperbolic distance x are connected. The average clustering (over all nodes) in 
the optimization and PA networks is c= 0.83 and c=0.12, respectively. 
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We show in Supplementary Information section IV that the described 
optimization model leads to clustering that is the strongest possible for 
networks with a given degree distribution. 

The strength of clustering and the power-law exponent can both be 
adjusted to arbitrary values via the following model modifications. We 
first consider the effect of popularity fading, observed in many real 
networks”””*, We note that the closer the node to the centre in Fig. 1c, 
the more popular it is: the higher its degree, and the more new con- 
nections it attracts, which explains why PA emerges in the model. 
Therefore to model popularity fading, we let all nodes drift away from 
the centre such that the radial coordinate of node s at time f>s is 
increasing as 7,(f) = fr, + (1 — B)r; where r, = In s and r, = In t, and 
parameter fh € [0, 1]. This modification is identical to minimizing 
s°0., (or om with £ = b/a) instead of s0,,. It changes the power-law 
exponent to y=1+ 1/fh=2. If f=1, the nodes do not move and 
y = 2. If 6 = 0, all nodes move with the maximum speed, always lying 
on the circle of radius r,, while the network degenerates to a random 
geometric graph growing on the circle. PA emerges at any y = 1 + 1/6 
since the attraction probability //(k) is a linear function of degree k, 
II(k) x k + m(y — 2), the same as in PA*. We prove these statements 
in Supplementary Information sections IV-VII, where we also show 
that the popular fitness model'® can be mapped to our geometric 
optimization framework by letting different nodes drift away with 
different speeds (Supplementary Information section V). 

Because the strongest clustering is due to connections to the closest 
nodes, to weaken clustering we allow connections to more-distant 
nodes. Connecting to the m closest nodes is approximately the same 
as connecting to nodes lying within distance R, ~ r; (see Fig. 1c and 
Supplementary Information section IV, where we derive the exact 
expression for R,, which controls the average degree in the network). 
If new nodes t establish connections to existing nodes s at hyperbolic 
distance x,, with probability p(x,,) = 1/{1 + exp[(x,, — R,)/T]}, where 
parameter T=O is the network temperature (see Supplementary 
Information sections IV and VI), then clustering is a decreasing func- 
tion of temperature. That is, temperature is the parameter controlling 
clustering in the network. At zero temperature, the connection 
probability p(x,;) is either 1 or 0 depending on whether distance x, 
is less or greater than R;, so that we recover the strongest clustering case 
above, where new nodes connect only to the closest existing nodes. 
Clustering gradually decreases to zero at T=1, and remains 
asymptotically zero for any T=1 (Supplementary Information 
sections IV, VI). At high temperatures T' — ™, the model degenerates 
either to growing random graphs, or, remarkably, to standard PA 
(Supplementary Information section VII). 

To investigate if similarity shapes the structure and dynamics of real 
networks as our model predicts, we consider a series of historical snap- 
shots of the Internet, the E. coli metabolic network, and the social 
network of trust relationships between people, also known as the 
web of trust (WoT). The first two networks are disassortative (nodes 
of dissimilar degrees are connected with a higher probability), while 
the third is assortative (nodes of similar degrees are connected with a 
higher probability), and its degree distribution deviates from a power 
law. We map these networks to their popularity similarity spaces 
(Methods Summary). The mapping infers the radial (popularity) and 
angular (similarity) coordinates for all nodes, so that we can compute 
the hyperbolic distances between all node pairs, and the probabilities of 
new connections as functions of the hyperbolic distance between 
corresponding nodes. These probabilities are shown in Fig. 3. In all 
the three networks, they are close to the theoretical predictions of our 
model. 

This finding is important for several reasons. First, it shows that real- 
world networks evolve as our framework predicts. Specifically, given 
the popularity and similarity coordinates of two nodes, they link with 
probability close to the theoretical value predicted by the model. The 
framework may thus be used for link prediction, a notoriously difficult 
and important problem in many disciplines”, with applications 
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Figure 3 | Popularity x similarity optimization for three different 
networks. a, The growing Internet; b, E. coli metabolic network; and ¢, pretty- 
good-privacy (PGP) web of trust (WoT) between people. Each plot shows the 
probability of connections between new and old nodes, as a function of the 
hyperbolic (popularity X similarity) distance x between them in the real 
networks (circles and squares) and in PA emulations (diamonds and triangles). 
To emulate PA, new links are disconnected from old nodes to which these 
links are connected in the real networks, and reconnected to old nodes 
according to PA. For a pair of historical network snapshots Sp (older) and S; 
(newer), new nodes are the nodes present in S; but not in So, and old nodes are 
the nodes present both in S, and Sp. Each plot shows the data for two pairs of 
such historical snapshots. The solid curve in each plot is the theoretical 
connection probability in the optimization model with the parameters 
corresponding to a given real network. Because the probability of new 
connections in the real networks is close to the theoretical curves, the shown 
data demonstrate that these networks grow as the popularity x similarity 
optimization model predicts, whereas PA, accounting only for popularity, is off 
by orders of magnitude in predicting the connections between similar (small x) 
or dissimilar (large x) nodes. To quantify this inaccuracy, the insets show the 
ratio between the connection probabilities in PA emulations and in the real 
networks, that is, the ratios of the values shown by diamonds and circles, and by 
triangles and squares in the main plots. The x-axes in the insets are the same as 
in the main plots. 
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ranging from predicting protein interactions or terrorist connections 
to designing recommender and collaborative filtering systems”’. 
Second, Fig. 3 directly validates our framework and its core mech- 
anism. It is not surprising then that, as a consequence, the synthetic 
graphs that the model generates are remarkably similar to real net- 
works across a range of metrics (Supplementary Information section 
IX), implying that the framework can be also used for accurate 
modelling of real network topologies. We review related work in 
Supplementary Information section X, and to the best of our knowledge, 
there is no other model that would simultaneously: (1) be simple and 
universal, that is, applicable to many different networks, (2) have a 
similarity space as its core component, (3) cast PA as an emergent 
phenomenon, (4) generate graphs similar to real networks across a wide 
range of metrics, and (5) validate the proposed growth mechanism 
directly. Validation is usually limited to comparing certain graph 
metrics, such as degree distribution, between modelled and real net- 
works; however, this ‘validates’ a consequence of the mechanism, not 
the mechanism itself. Direct validation is usually difficult, because pro- 
posed mechanisms tend to incorporate many unmeasurable factors— 
economic or political factors in Internet evolution, for example. Our 
approach is no different in that it cannot measure all the factors or node 
attributes contributing to node similarity in any of the considered real 
networks. Yet, the angular distances between nodes in our approach can 
be considered as projections of properly weighted combinations of all 
such similarity factors affecting network evolution, and we can infer 
these distances using statistical inference methods, directly validating 
the growth mechanism. 

To summarize, popularity is attractive, but so is similarity. 
Neglecting the latter would lead to severe aberrations. Within the 
Internet, for example, a local network in Nebraska would connect 
directly to a local network in Tibet, in the same way as on the web, a 
person not even knowing about Tartini or free solo climbing would 
suddenly link her page to these subjects. The probability of such 
dissimilar connections is very low in reality, and the stronger the 
similarity forces, the smaller this probability is. Neglecting the network 
similarity structure leads to overestimations or underestimations of 
the probability of dissimilar or similar connections by orders of mag- 
nitude (Fig. 3). However, one cannot tell the difference between our 
framework and PA by examining node degrees only. The probability 
that an existing node of degree k attracts a new link optimizing popu- 
larity X similarity is exactly the same linear function of k as in PA 
(Fig. 2a). Supplementary Fig. 1 shows that this function is indeed 
realized in the considered real networks, re-validating effective PA 
for these networks. Therefore the popularity X similarity optimization 
approach provides a natural geometric explanation for the following 
‘dilemmas’ characteristic of PA. On the one hand, PA has been vali- 
dated for many real networks, while on the other hand, it requires 
exogenous mechanisms to explain not only strong clustering, but also 
linear popularity preference, and how such preference can emerge in 
real networks, where nodes do not have any global information about 
the network structure. As PA appears as an emergent phenomenon in 
the framework developed here, our framework provides a simple and 
natural resolution to these dilemmas, and this resolution is directly 
validated against large-scale evolution of very different real networks. 

We conclude with the observation that to know the closest nodes in 
the hyperbolic popularity < similarity space requires precise global 
information about all node locations. However, non-zero tempera- 
tures smooth out the sharp connectivity perimeter threshold in 
Fig. 1c, thus modelling reality where this proximity information is 
not precise and mixed with errors and noise. In that respect, PA is a 
limiting regime with similarity forces reduced to nothing but noise. 


METHODS SUMMARY 
To infer the radial r; and angular 0; coordinates for each node i in a real network 
snapshot with adjacency matrix ip i,j =1, 2, ..., t, we use the Markov Chain 


Monte Carlo (MCMC) method described in detail in Supplementary Information. 
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Specifically, we derive there the exact relation between the expected current degree 
k; of node i and its current radial coordinate r;, which scales as k; ~ e”'~". To infer 
the radial coordinates we use the same expression substituting in it the real degrees 
k; of nodes instead of their expected degrees. Having inferred the radial coordi- 
nates, we then execute the Metropolis- Hastings algorithm to find the node angular 
coordinates that maximize likelihood £=Tj<;p (xj) [1—p(xi) | 144 where 
(xi) =1/)1+e\" ~®)/T| is the connection probability in the model, and para- 
meters Rand T are defined by the average node degree and clustering in the 
network via expressions in Supplementary Information section IV. Likelihood 
L£ is the probability that the network snapshot with node coordinates (r;, 0;), 
defining the hyperbolic distances x; between all nodes, is produced by the model. 
The algorithm employs an MCMC process, which finds coordinates 0; for all i that 
approximately maximize £. Further details are in Supplementary Information 
sections II and III, where we also show that the method yields meaningful results 
for the considered networks, but not for a network (movie actor collaborations) to 
which popularity X similarity optimization does not apply. 

The nodes in Fig. 3a, b and c are respectively autonomous systems (ASs), 
metabolites and Pretty Good Privacy (PGP) certificates associating users’ email 
addresses with their cryptographic keys. Parameters (R, T) used to infer the coor- 
dinates and to draw the theoretical connection probabilities are (25.2, 0.79), (14.4, 
0.77) and (23, 0.59). Each panel of Fig. 3 shows data for two pairs of snapshots: 
Fig. 3a, January-April 2007 and April-June 2009; Fig. 3b, Sp—S; and S,—S> defined 
in Supplementary Information section I; and Fig. 3c, April-October 2003 and 
December 2005-December 2006. The few missing data points in the empirical 
curves (circles and squares) indicate that there are no node pairs at the corres- 
ponding distances after the mapping, whereas extra missing points in the PA 
emulation curves (diamonds and triangles) indicate that all node pairs at those 
distances are not connected after PA emulations, meaning that the PA connection 
probability is zero there. 
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A single-atom electron spin qubit in silicon 
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& Andrea Morello! 


A single atom is the prototypical quantum system, and a natural 
candidate for a quantum bit, or qubit—the elementary unit of a 
quantum computer. Atoms have been successfully used to store 
and process quantum information in electromagnetic traps’, as 
well as in diamond through the use of the nitrogen—vacancy-centre 
point defect”. Solid-state electrical devices possess great potential 
to scale up such demonstrations from few-qubit control to larger- 
scale quantum processors. Coherent control of spin qubits has 
been achieved in lithographically defined double quantum dots 
in both GaAs (refs 3-5) and Si (ref. 6). However, it is a formidable 
challenge to combine the electrical measurement capabilities of 
engineered nanostructures with the benefits inherent in atomic 
spin qubits. Here we demonstrate the coherent manipulation of 
an individual electron spin qubit bound to a phosphorus donor 
atom in natural silicon, measured electrically via single-shot read- 
out’. We use electron spin resonance to drive Rabi oscillations, 
and a Hahn echo pulse sequence reveals a spin coherence time 
exceeding 200 ps. This time should be even longer in isotopically 
enriched **Si samples!“!!. Combined with a device architecture!” 
that is compatible with modern integrated circuit technology, the 
electron spin of a single phosphorus atom in silicon should be an 
excellent platform on which to build a scalable quantum computer. 

There have been a number of proposals for the implementation of a 
spin-based qubit in silicon’’, though none have been studied in as much 
detail as the phosphorus atom qubit’*. This interest has been motivated 
by the knowledge, developed over half a century from electron spin 
resonance experiments on bulk-doped phosphorus in silicon’*, that 
spin coherence times can be exceptionally long, exceeding seconds”’. 
This is due to the availability of silicon in an enriched nuclear spin-zero 
(78Si) form, as well as the low spin-orbit coupling in silicon’’. The use of 
donor electron spins has further advantages of consistency (because 
each atom is identical) and tuneability (for example, through the 
Stark shift'®), and the donor atom’s nuclear spin can be employed as 
a quantum memory for longer term storage’’. 

Using methods compatible with existing complementary metal- 
oxide-semiconductor (CMOS) technology, we fabricated a nanostruc- 
ture device on the SiO, surface to enable read-out and control of an 
electron spin’? (Fig. 1a). In this work, the donor is intentionally 
implanted into the silicon substrate, with future options including 
the use of deterministic ion implantation’® or atomic precision in 
donor placement through scanning probe lithography”’. The device 
is placed in a magnetic field of approximately 1 T, yielding well-defined 
electron spin-down and spin-up states (||) and |)). 

Transitions between the electron | |) and |) states are driven by an 
oscillating magnetic field generated by applying microwaves to an 
on-chip broadband transmission line*”’. By operating at a high mag- 
netic field and low temperature (Tetectron ~ 300 MK), we can detect 
these transitions through single-shot projective measurements on 
the electron spin with a process known as spin-to-charge conversion’*. 
Here the donor electron is both electrostatically coupled and tunnel- 
coupled to the island of a single electron transistor (SET), with the SET 


serving as both a sensitive charge detector and an electron reservoir for 
the donor. Using gates PL and TG (Fig. 1a) to tune the electrochemical 
potentials of the donor electron spin states (14; and 4; for states | |) and 
|?)) and the Fermi level in the SET island (tgpy), we can discriminate 
between a ||) or |{) electron as well as perform electrical initialization 
of the qubit, following the procedure introduced in ref. 8. 

Our experiments use a two-step cyclical sequence of the donor 
potential, alternating between a spin read-out/initialization phase 
and a coherent control phase (see Supplementary Video). The qubit 
is first initialized in the ||) state through spin-dependent loading by 
satisfying the condition 1) < psp < 4; (Fig. 1b). After this, the system 
is brought into a regime where the spin is a stable qubit (1), “;¥§ << User) 
and manipulated with various microwave pulse schemes resonant with 
the spin transition (Fig. 1c). The spin is then read out electrically via 
spin-to-charge conversion (Fig. 1b), a process which produces a pulse 
in the current through the SET (that is, Isp) if the electron was |), and 
leaves the qubit initialized ||) for the next cycle. 

The electron spin resonance frequency can be extracted from the 
spin Hamiltonian describing this system (see also Fig. 1d): 


H= VeBoSz ~~ YnBolz +AST (1) 


where y, (or ’,) is the gyromagnetic ratio of the electron (or nucleus), By 
is the externally applied magnetic field, S (or I) is the electron (or nuclear) 
spin operator with z-component S, (or I,) and A is the hyperfine con- 
stant. If By >> A, the states shown in Fig. 1d are good approximations 
for the eigenstates of equation (1). Allowed transitions involving flips of 
the electron spin only (identified by arrows in Fig. 1d) exhibit resonance 
frequencies that depend on the state of the *'P nuclear spin: 
Ver ~ YeBo — A/2 for nuclear spin-down; and v.2~y.By + A/2 for 
nuclear spin-up. The transition frequencies v., and v2 are found by 
conducting an electron spin resonance (ESR) experiment’', which is 
described in the Supplementary Information. 

To demonstrate coherent control, we apply a single microwave 
pulse of varying duration ¢, to perform Rabi oscillations of the electron 
spin. For each ¢, the cyclic pulse sequence (Fig. le, f) is repeated 20,000 
times, first with a microwave frequency v.;, and immediately after at 
Veo. It is necessary to pulse on both ESR transitions as the 1 nuclear 
spin can flip several times during acquisition of the data in Fig. 2a. 
Figure 1g displays single-shot traces of the SET output current Isp for 
four consecutive repetitions of the measurement sequence, for an 
arbitrary pulse length. A threshold detection method’ is used to deter- 
mine the fraction of shots that contain a ||) electron for the measure- 
ments at both frequencies. Figure 2a shows the electron spin-up 
fraction f; as a function of the microwave pulse duration for different 
applied powers Prsr. The fits through the data are derived from 
simulations assuming Gaussian fluctuations of the local field (see 
Supplementary Information). Confirmation that these are Rabi 
oscillations comes from the linear dependence of the Rabi frequency 
with the applied microwave amplitude (Pgsp’”), that is, Frabi = YeB1- 
Here B, is taken as half of the total linear oscillating magnetic field 
amplitude generated by the transmission line at the site of the donor, 
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Figure 1 | Qubit device and pulsing scheme. a, Scanning electron micrograph 
of a qubit device similar to the one used in the experiment. The SET (lower right 
portion) consists of a top gate (TG), plunger gate (PL), left and right barrier gates 
(LB and RB) and source/drain contacts (S and D). The microwave transmission 
line is shown in the upper left portion. The donor (blue) is subject to an 
oscillating magnetic field B, from the transmission line which is perpendicular to 
the in-plane external field By. b and c, Pulse sequence for the qubit initialization, 
control and read-out. b, Read/initialization phase Wy) < spy < My: a spin-up 
electron will tunnel from the donor to the SET island, to be later replaced by a 
spin-down electron, causing a pulse of current through the SET. A spin-down 
electron remains trapped on the donor throughout the entire phase. c, Control 
phase /; /l; < Uspr: electron spin states are plunged well below the SET island 
Fermi level while microwaves are applied to the transmission line to perform 
electron spin resonance. d, Energy level diagram of the *'P electron-nuclear 
system. e and f, Microwave pulse sequence (e) and synchronized PL gate voltage 
waveform (f) for performing and detecting spin manipulations (not drawn to 
scale). An arbitrary ESR pulse sequence is represented by each of the dashed 
purple boxes in panel e. g, Example of Isp response to four consecutive read/ 
control events where a single microwave pulse of duration f, is applied, taken at 
By = 1.07 T. The pulse duration t, has been set to give a high probability of 
flipping the electron spin. The duration of the pulses in Isp gives the electron 
spin-down tunnel-in time (about 33 j1s), while their delay from the beginning of 
the read phase gives the spin-up tunnel-out time (about 295 1s). 


assuming the rotating-wave approximation. Figure 2b shows the 
expected linear behaviour with microwave amplitude of the Rabi fre- 
quencies extracted from the data in Fig. 2a. The largest Rabi frequency 
attained was 3.3 MHz (B, ~ 0.12 mT), corresponding to a 7/2 rotation 
in about 75 ns. 

The qubit manipulation time should be contrasted with the coher- 
ence lifetime of the qubit, termed T>. Possible sources of decoherence 
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Figure 2 | Rabi oscillations and power dependence of the Rabi frequency. 
a, Electron spin-up fraction as a function of the microwave burst duration for 
varying input powers Pgsr. Measurements were performed at an external field 
of By = 1.07 T where the ESR frequencies are v.; = 29.886 GHz and 

Ve2 = 30.000 GHz. Each point represents an average of 20,000 single-shot 
measurements, with each shot about 1 ms in duration (see Supplementary 
Information for further details). The solid lines are fits generated from 
simulations of the measurements (Supplementary Information). b, Rabi 
frequency versus the microwave excitation amplitude, with a fit displaying the 
linear relationship. 


include spectral diffusion of the *°Si bath spins!>???3, noise in the 
external magnetic field, and paramagnetic defects and charge traps 
at the Si/SiO, interface**. These mechanisms can, to a degree, be com- 
pensated for by using spin echo techniques (Fig. 3a), as long as the 
fluctuations are slow compared with the electron spin manipulation 
time (typically around 100 ns). 

Figure 3a presents the gate voltage and microwave pulsing scheme 
for a Hahn echo measurement. Dephasing resulting from static local 
contributions to the total effective field during an initial period 7, is 
(partially or fully) refocused by a 1 rotation followed by a second 
period t2 (see Fig. 3c for a Bloch sphere state evolution), A spin echo 
is observed by varying the delay t, and recording the spin-up fraction. 
In Fig. 3e we plot the difference in delay times (t2 — 1)) against f;. For 
T1 = Tz, we expect to recover a ||) electron at the end of the sequence if 
little dephasing occurs (that is, for short t), and hence observe a 
minimum in f;. When 12 — t; ¥ 0, imperfect refocusing results in an 
increase in the recovered spin-up fraction. The echo shape is approxi- 
mated as being Gaussian and the half-width at half-maximum implies 
a pure dephasing time of T,* = 55 + 5ns. 

We now set t = Tt, = T, and monitor the spin-up fraction as a func- 
tion of t, to obtain the spin echo decay curve of Fig. 3f. A fit of the form 
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Figure 3 | Coherence time and dynamical decoupling. a and b, Pulse 
protocols for the Hahn echo (a) and XYXY dynamical decoupling (b) sequences 
with accompanying PL gate voltage waveforms, as described in the main text. The 
rotation angles are displayed above each pulse in brackets, with the subscript (X or 
Y) denoting the axis on the Bloch sphere about which the rotation is applied. The 
read/initialization time is 1 ms. All measurements were performed at By = 1.07 T 
and with Psp = 10 dBm, where a 77/2 rotation takes around 75 ns. c and d, Bloch 
sphere representation of the evolution in the rotating frame for the Hahn echo 
(c) and XYXY (d) sequences. The green arrow represents the initial spin state | |), 
while the grey arrow represents the final state for the case when the second n/2 
pulse is about X (Y is not shown). The purple path represents dephasing in 
between pulses, the orange path represents a rotation about X, and the blue path is 
a rotation about Y. We have included rotation angle errors of 5° and 15° for the 
m/2 and pulses respectively. e, An echo curve, obtained by applying the depicted 
pulse sequence with a fixed t, (=10 Us) and varying t>. Each point represents the 
electron spin-up fraction f; calculated from 50,000 single shots acquired at both 
ESR frequencies (v.; = 29.886 GHz and v2 = 30.000 GHz) and summed. The fit 
in red is Gaussian and of the form f; = Bexp(—[(t2 — 7))/ C]’) + D.f, Hahn echo 
(or XYXY dynamical decoupling) decay in red circles (or blue squares), measured 
via simulated quadrature detection (see the Methods for details). A fit through the 
data is given by y = exp[(— (Nt/T;)°], where N = 2 (or N = 8) for the Hahn echo 
(or XYXY dynamical decoupling) experiment. Parameter values are discussed in 
the main text. 
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y = exp(—(2t/ T>)’), where T> and b are free parameters, yields 
Tz = 206 + 12 ts and b = 2.1 + 0.4. The coherence time T> is almost 
a factor of 2,000 times longer than T,*, and is remarkably close to the 
value (3001s) measured in bulk-doped natural silicon samples”. 
Variations in T> can be expected, depending on the exact distribution 
of *°Si nuclei within the extent of the donor electron wavefunction. 
This indicates that the presence of a nearby SET and the close proximity 
of the Si/SiO, interface have little, if any, effect on the electron spin 
coherence. This is not entirely surprising, because paramagnetic centres 
at the Si/SiO, interface are expected to be fully spin-polarized under our 
experimental conditions gjtpBp >> kgT (where g is the donor electron 
Landé g-factor, fg is the Bohr magneton and kg is the Boltzmann 
constant), leading to an exponential suppression of their spin fluctua- 
tions®. Direct flip-flop transitions between the donor qubit and nearby 
interface traps are suppressed by the difference in g-factor (g = 1.9985 
for the donor, g> 2 for the traps’), whereas dipolar flip-flops with 
nearby donors” can appear as a T process*® on a much longer timescale. 
We measured T, ~ 0.7s at By = 2.5 T (data not shown), implying that 
this process has no bearing on T>. The echo decay is Gaussian in shape 
(b = 2.1 + 0.4), consistent with decoherence dominated by 2°Si spectral 
diffusion”. 

We have extended the coherence time by applying an XYXY 
dynamical decoupling ESR pulse sequence* (Fig. 3b and d). This 
sequence substitutes the single m rotation of the Hahn echo with a 
series of four 7 rotations alternating about the X and Y axes, achieved 
by applying adjacent 1 pulses that are 90° out of phase. The resulting 
echo decay is shown in Fig. 3f, with a fit to the data yielding 
T> = 410 + 20 us and b = 2.1 + 0.4. As well as representing a factor- 
of-two improvement in T>, the XYXY sequence demonstrates the 
ability to perform controlled rotations about two orthogonal axes on 
the Bloch sphere (X and Y), permitting arbitrary one-qubit gates for 
universal quantum computing”. 

Next we consider the fidelity of our electron spin qubit, broken 
down into three components: measurement, initialization and control. 
The measurement fidelity Fj, comprises errors resulting from detec- 
tion limitations of the experimental set-up as well as thermally induced 
read-out events. The electrical spin-down and spin-up read errors 
(y, and y; respectively) arise from a finite measurement bandwidth 
and signal-to-noise ratio. They depend on the threshold current Ly 
used for detecting the spin-up pulses. Figure 4a shows the results of 
a numerical model based on our experimental data (see Supplementary 
Information for details), where j, ; are plotted as a function of Iy. At 
I; = 370 pA we achieve a best-case error of y = y, + y; = 18%. 

Thermal broadening of the Fermi distribution in the SET island 
produces the read/load errors, as depicted in Fig. 4b. The process of 
a spin-down electron tunnelling into an empty state in the SET occurs 
with a probability «, whereas f denotes the probability of incorrectly 
initializing the qubit in the spin-up state. The parameters « and f are 
sensitive to the device tuning and can vary slightly between measure- 
ments. We have extracted « and f from simulations of the Rabi 
oscillations in Fig. 2a, and for Pgsp = 10dBm we find « = 28 + 1% 
and f= 1+?%. This gives an average measurement fidelity for the 
electron spin-up and spin-down states of Fy = 1— (y+ «(1 —,))/ 
2=77+2% and an initialization fidelity F, of at least 90% (see 
Supplementary Information for full details). 

The qubit control fidelity Fc is reduced by random field fluctuations 
from the *’Si nuclear bath spins. These produce an effective field Begin 
the rotating frame that is tilted out of the X-Y plane (Fig. 4d), and lead 
to imperfect pulses. We now estimate the strength of these fluctua- 
tions. Figure 4c presents a series of ESR spectra, where the electron 
spin-up fraction is monitored as a function of the microwave fre- 
quency. The top three traces of Fig. 4c contain individual sweeps with 
each point obtained over a timescale of around 250 ms. We attribute 
the shift in peak position between sweeps to slow fluctuations of a few 
strongly coupled *’Si nuclei, with hyperfine coupling strengths of the 
order of 1 MHz. The width of the peaks is most probably the result of 
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Figure 4 | Qubit fidelity analysis. a, Electrical read-out errors generated from 
a numerical model. The red curve gives the error 7, involved in identifying a | |) 
electron as a function of the threshold current I;, caused by noise in Ispy 
exceeding I. The blue curve represents the error 7; for detecting a | [) electron, 
which occurs as a result of detection bandwidth limitations and a finite | [) Iser 
pulse height*. The dashed curve depicts the combined electrical error, 

y =y, +y;- b, Mechanisms by which read (top) and load (bottom) errors are 
produced as a result of thermal broadening in the SET island (discussed in the 
main text). The solid circles represent full electron states with spin indicated by 
the arrow, while the empty circles signify unoccupied states. c, Sweeps of the 
frequency Vgsp in the vicinity of the nuclear spin-up ESR transition veo. The top 
three traces are individual sweeps where f; at each Vggp is calculated from 250 
single-shot measurements. The bottom trace is an average of 100 sweeps. 

d, Illustration of the rotation errors created by hyperfine field fluctuations of the 
?°Si nuclear bath. For simplicity, only the Z-component of the hyperfine field 
has been shown. The bath nuclear spins produce an offset from resonance, AB, 
which causes rotations about a new axis aligned with Beg. 


distant, weakly coupled *’Si nuclear spins that fluctuate on the single- 
shot timescale (see Supplementary Information for further discus- 
sion). The bottom trace of Fig. 4c contains an average of 100 sweeps, 
representing many nuclear spin configurations. From this we extract a 
full-width at half-maximum Av = 7.5 + 0.5 MHz. This is consistent with 
the observed T>*, where Av = 1/(mT>*) = 6 + 1 MHz. To calculate the 
rotation angle error, we simulate a Rabi experiment assuming the largest 
B, achieved (0.12 mT) and Gaussian fluctuations of the nuclear bath with 
a standard deviation of ¢=Av/ 2/2In(2)) = 3.2 + 0.2 MHz (see 
Supplementary Information). From this we infer an average tip angle 
of 102 + 3° for an intended 7 rotation, corresponding to an average 
control fidelity of Fo = 57 + 2%. 

The processes that contribute to the measurement, initialization and 
control fidelity degradation can be mitigated with foreseeable adjust- 
ments to the device architecture and experimental set-up. Significant 
improvements in the read/load errors would follow from enhanced 
electrical filtering to lower the electron temperature, thus enabling the 
high read-out fidelities (>90%) already achieved®. Moving to an 
enriched 7*Si (nuclear spin-zero) substrate” would remove the primary 
source of rotation angle error, and allow for the exceptional coherence 
times already demonstrated in bulk-doped samples”’. 

Future experiments will focus on the coupling of two donor electron 
spin qubits through the exchange interaction", a key requirement in 
proposals for scalable quantum computing architectures in this sys- 
tem”. Taken together with the single-atom doping technologies'’*”” 
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now demonstrated in silicon, the advances reported here open the 
way for a spin-based quantum computer using single atoms, as first 
envisaged by Kane™* more than a decade ago. 


METHODS SUMMARY 

Device fabrication and experimental set-up. For information relating to the 
device fabrication and experimental set-up, see the Supplementary Information. 

Simulated quadrature detection for T, measurements. For each t (t = 1) = T2 
for the Hahn echo), the sequence of Fig. 3a (or Fig. 3b) is repeated 30,000 times (or 
75,000 times) for the Hahn echo (or XYXY dynamical decoupling) measurement at 
both v,) and v_2, and for X and Y phases of the final 7/2 rotation. The resulting signal 
amplitude is given by (f:(ve1, Y) —fi(verX)) + (f(ver Y) — fiver, X)), where 
fi (vei, Y) represents the electron spin-up fraction of the single-shot traces taken 
at v., with a final 1/2 pulse about the Y-axis, and so on. The data points in Fig. 3f 
have been re-normalized with the amplitudes and offsets extracted from free- 
exponent fits through the decays. A 30% reduction in signal amplitude was observed 
for the XYXY dynamical decoupling decay, relative to that of the Hahn echo. 
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Metazoans are likely to have their roots in the Cryogenian 
period’, but there is a marked increase in the appearance of novel 
animal and algae fossils shortly after the termination of the late 
Cryogenian (Marinoan) glaciation about 635 million years ago*°. 
It has been suggested that an oxygenation event in the wake of the 
severe Marinoan glaciation was the driving factor behind this early 
diversification of metazoans and the shift in ecosystem complexity”*. 
But there is little evidence for an increase in oceanic or atmospheric 
oxygen following the Marinoan glaciation, or for a direct link 
between early animal evolution and redox conditions in general’. 
Models linking trends in early biological evolution to shifts in 
Earth system processes thus remain controversial’®. Here we report 
geochemical data from early Ediacaran organic-rich black shales 
(~635-630 million years old) of the basal Doushantuo Formation 
in South China. High enrichments of molybdenum and vanadium 
and low pyrite sulphur isotope values (A**S values =65 per mil) 
in these shales record expansion of the oceanic inventory of 
redox-sensitive metals and the growth of the marine sulphate 
reservoir in response to a widely oxygenated ocean. The data 
provide evidence for an early Ediacaran oxygenation event, which 
pre-dates the previous estimates for post-Marinoan oxygenation’ * 
by more than 50 million years. Our findings seem to support a link 
between the most severe glaciations in Earth’s history, the oxygena- 
tion of the Earth’s surface environments, and the earliest diversifica- 
tion of animals. 

The increase of oxygen in Earth’s surface environments was pro- 
tracted and is thought to have proceeded in two major steps. The 
ocean-atmosphere system was essentially devoid of oxygen until very 
early in the Proterozoic eon, when atmospheric oxygen rose to >1% of 
the present atmospheric level (PAL)'**’. The timing and dynamics of 
this initial oxygenation are under active investigation, but there is little 
doubt that a major atmospheric transition, the “Great Oxidation Event’ 
(GOE), occurred about 2.4 billion years (Gyr) ago'*". It is commonly 
assumed that there was a second significant oxygen rise to near PAL 
during the late Neoproterozoic (~750-542 Myr ago)'*"*; however, 
the timing and magnitude of this second oxygenation event remain 
elusive?”’. 

The appearance of metazoan fossils has traditionally been used as a 
minimum estimate for the timing of the late Neoproterozoic oxygena- 
tion event’®. Molecular clock estimates place the origin of crown-group 
animals in the Cryogenian period (850-635 Myr ago)’. There are also 
sponge biomarkers* and sponge-like fossils’ in Cryogenian or older 
rocks, but these are only simple metazoans with limited oxygen demands 
(see Supplementary Information). Novel micro-and macrofossils 
interpreted as early metazoans appear immediately above Marinoan- 
age (~635 Myr ago) glacial deposits in South China** (see Supplemen- 
tary Information). Further, there is a radiation in marine algae 
following the Marinoan glaciation, which is probably linked to a 


metazoan-driven shift in trophic structure and ecosystem complexity’”. 
It has been proposed that this biological innovation is linked to an 
oxygenation event following the extensive Marinoan glaciation”. 
However, there is no direct geochemical evidence for an increase in 
ocean oxygenation in the immediate aftermath of the Marinoan 
glaciation. Most existing geochemical evidence for a late Neoproterozoic 
redox shift is much younger (~580-550 Myr old), controversial, or 
records a local shift in redox conditions that may not be globally 
representative’’*”*. 

Trace-metal enrichments in black shales can record information 
about the global ocean redox state’*'*”°. Following the establishment 
of pervasive oxidative weathering after the GOE at ~2.4 Gyr ago’*”, 
the size of the global marine reservoir of redox-sensitive elements 
(RSEs) was primarily controlled by the spatial extent of anoxic versus 
oxic marine conditions’*”. In reducing marine environments, the 
burial fluxes of many RSEs, notably molybdenum (Mo) and vanadium 
(V), exceed those in oxygenated settings by several orders of magnitude. 
Hence, it follows that when oxic conditions are more widespread, the 
global seawater concentrations of these RSEs will be higher'®. For 
example, in today’s predominantly oxygenated oceans, Mo is the most 
abundant transition metal (~105nM; ref. 18), despite its very low 
crustal abundance (~1 p.p.m,; ref. 21). Because the residence time of 
Mo (800-440 kyr)'*” and V (~50kyr)’® in seawater is much longer 
than the ocean mixing time (~1.5 kyr), Mo and V in marine basins 
track the global average conditions'**°”*. This concept is well- 
grounded in an understanding of the modern Mo and V global mass 
balances'*””. 

The magnitude of RSE enrichments in anoxic marine sediments 
reflect dissolved RSE concentrations in seawater”®”’. In general, RSE 
enrichments also scale with the organic carbon flux because reduced, 
particle-reactive metal species (for example, thiomolybdate or vanadyl 
ions) are bound by organic particles, resulting in strong correlations 
between the metal and total organic carbon (TOC) contents in 
modern anoxically deposited sediments and black shales’. Therefore, 
authigenic enrichments are commonly normalized to TOC concentra- 
tions (for example, Mo/TOC and V/TOC)*. RSE enrichments in 
anoxic shales may also depend on other factors, such as sedimentation 
rates and sulphide levels. However, studies in modern anoxic basins 
with access to the open ocean’* *°”* have shown that the dissolved metal 
concentrations exert a first-order control on the degree of the enrich- 
ment (see Supplementary Information). This relationship sets the stage 
for using RSE enrichments in anoxic shales to track Earth’s oxygenation 
history and specifically the extents of anoxia in the ocean”’*”*. 

The temporal record of RSE enrichments in shales deposited beneath 
an anoxic water column currently provides the least controversial and 
most direct geochemical signal for a significant redox shift in the late 
Neoproterozoic”’*. However, the existing data only show a jump in 
RSE enrichments to Phanerozoic values near the end of the Ediacaran 
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period (~551 Myr ago)”"*4, long after the radiation of complex 
metazoans including triploblastic animals*°’°. There is a pronounced 
data gap between this time and 663 Myr ago in previous analyses due to 
the lack of suitable open-marine, deep-water black shale samples””’. 
Here we fill this gap by analysing organic-rich black shales of the basal 
Doushantuo Formation in South China that were probably deposited 
between 635 and 630 Myr ago (see Supplementary Information). We 
found very high, Phanerozoic-like RSE enrichments within a few 
million years after the Marinoan glaciation, coincident with the 
appearance of the earliest non-poriferan metazoan fossils*”. 

In South China, Marinoan-age glacial diamictites of the Nantuo 
Formation are overlain by a 3-to-6-m thick, 635.2 + 0.6 Myr old”® 
cap carbonate commonly referred to as Member I of the Doushantuo 
Formation (Fig. 1)”. The cap carbonate is conformably overlain by 
organic-rich black shales (Member II of the Doushantuo Formation), 
with subordinate carbonate layers and phosphorite-chert nodules in 
shelf and upper-slope environments” (see Supplementary Informa- 
tion). Putative metazoan fossils were found 6 m above the Doushantuo 
cap carbonate in the Yangtze Gorges area*’, less than two metres 
above an ash bed that has been dated as 632.5 + 0.5 Myr old”. 
Morphologically complex macroscopic fossils of the Lantian biota® 
were reported from slope-basinal black shales, approximately 15m 
above the cap carbonate (Fig. 1). Our geochemical analyses focus on 
the basal Member II black shales, which are roughly equivalent or 
slightly below these fossil horizons. 

The basal Member II black shales have a transitional contact with the 
underlying cap carbonate’*. Their widespread occurrence across the 
basin and lack of event beds (for example, turbidites and olistostromes) 
suggest that they were deposited during the latest stage of postglacial 
transgression or sea-level highstand, with the shelf-to-basin topography 
inherited from the Cryogenian” (see Supplementary Information). We 
focused on deepwater Member II samples collected from three sections 
in an open-marine, slope/basinal setting (Fig. 1 and Supplementary 
Fig. 1). The black shales in the lower slope and basin sections contain 
an average of ~2.0 wt% of both pyrite iron (Fepyrite) and total organic 
carbon (TOC). The upper slope section has comparatively lower 
Fepyrite (~0.3 wt% on average) and higher TOC (2.3 wt% on average); 
see Supplementary Table 1. 

Slope and basin post-glacial black shales of Member II show 
strong Mo and V enrichments (Fig. 2). Mo enrichments in these 
sections commonly exceed the average Phanerozoic euxinic (anoxic 
and sulphidic) shale concentration of ~ 100 p.p.m. and the Phanerozoic 
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average Mo/TOC ratio (p.p.m./wt%) of ~25 (refs 9, 13). These high Mo 
enrichments show that the local water column was not only anoxic but 
also had free dissolved sulphide (euxinic conditions)”. Vanadium 
enrichments of several thousand p.p.m. are common and equivalent 
to the largest values observed in the Phanerozoic anoxic shale record’ 
(Fig. 3). Similarly, uranium (U) in the lower Doushantuo Formation 
(Fig. 2) exceeds 30 p.p.m.—equivalent to the largest enrichments 
observed in modern anoxic basins”. Therefore, the magnitude of 
RSE enrichments used previously to argue for a late Neoproterozoic 
oxygenation event at ~551 Myr ago””* are now found in >630-Myr- 
old, early Ediacaran black shales (Fig. 3). It is well established that basin 
restriction, if anything, decreases the extent of RSE enrichment””’, 
ruling out the possibility that the large enrichments have only local 
significance (see Supplementary Information). Also, other factors, 
such as persistent or transient occurrences of oxygen in overlying 
waters, consistently mute the magnitude of RSE enrichment”. 
Hence, the large RSE enrichments in the lower Doushantuo 
Formation provide strong evidence for a significant global marine 
oxygenation event in the aftermath of the Marinoan glaciation. 
Simple mass-balance calculations confirm that the observed Mo and 
V enrichments in the lower Doushantuo Formation require a well- 
oxidized ocean (see Supplementary Information). Building from the 
global Mo and V cycles’*"”, it is possible to estimate the effects of 
increasing riverine RSE flux on the marine Mo and V reservoirs. 
Even with a very elevated riverine flux of Mo and V, the marine redox 
landscape will control RSE reservoir sizes and thus RSE enrichments. 
For instance, based on mass-balance calculations, even a doubling of 
the riverine Mo or V flux can be compensated for by relatively small 
growth in the extent of euxinia (for example, 1-2% of the seafloor area; 
see Supplementary Fig. 3). Although continental Mo and V fluxes may 
have been elevated for short periods (thousands of years) during the 
most rapid stages of glacial retreat, glacially elevated fluxes are not 
expected during deposition of the examined black shales, which span 
a period of millions of years after glacial retreat. Mass-balance calcula- 
tions also indicate that even a relatively small areal increase (for 
example, 2-3%) in anoxic seafloor would crash the Mo and V reservoirs 
on a timescale of =0.5 Myr (see Supplementary Information). Thus, 
the observed Mo and V enrichments in lower Doushantuo Formation 
black shales suggest that oxic waters bathed the vast majority of the 
ocean. Oxygen-deficient conditions must have been spatially limited 
and only located where high organic matter loading caused oxygen 
depletion along ocean margins, such as in the Nanhua basin. 
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Figure 1 | Locality maps and stratigraphy. a, Simplified geological map with 
location of sections (1, Taoying; 2, Wuhe; 3, Yuanjia). The dashed red line 
marks the shelf margin during deposition of the Doushantuo Formation. 
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b, Palaeogeographic model for the Doushantuo Formation (simplified from ref. 
27). ¢, Stratigraphy of the lower Doushantuo Formation with radiometric ages” 
and important fossil horizons** marked. 
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Figure 2 | Trace-metal abundances (Mo, Mo/TOC, V, V/TOC, U, U/TOC) 
and pyrite sulphur isotopes (5° 4S pyrite) from the lower Doushantuo 
Formation black shales. The shape of the data symbols indicates sample 
locations: Taoying (upper-slope section; squares), Wuhe (lower-slope section; 
diamonds), and Yuanjia (basin section; circles). The colour of the data symbols 
is used to distinguish sulphidic (Fe,,,ite/Feyr > 0.8; grey) from non-sulphidic 
(Fepyrite/Ferr < 0.8; light green) shales (Supplementary Fig. 2). Because a large 


Large variability in the magnitude of RSE enrichments in lower 
Doushantuo Formation black shales suggests locally variable redox 
conditions. There are intervals with high RSE enrichments adjacent 
(<0.2 m) to intervals with near-crustal, un-enriched RSE concentra- 
tions, despite apparently constant physical conditions (that is, absence 
of wave-, storm- and gravity-generated beds) well below the storm 
wave base (see Supplementary Information). These variations suggest 
significant redox shifts in environments and thus deposition of the 
units under mixed oxic, suboxic-ferruginous, and euxinic conditions. 
Redox fluctuations in deep-water environments well below the mixed 


portion of the Doushantuo samples have unusually low total iron (Fey)/ 
aluminium (Al) ratios (Supplementary Table 1), we do not use highly reactive 
iron (Feyg)/total iron (Fer) ratios as a quantitative palaeoredox proxy (see 
Supplementary Information). The dashed line in the 8S pyrite panel marks the 
average 5°“Scas value of the analysed stratigraphic interval“. Sulphur isotope 
data are reported as per mil (%o) deviations from the isotope composition of 
Vienna Canon Diablo Troilite (VCDT). 


layer, including those at an estimated depth of >1,000 m in the basinal 
section’”** (see Supplementary Information), are consistent with a 
well-oxygenated global ocean—atmosphere system. 

All Mo, V and U concentrations shift to lower values in the upper part 
of the basal Doushantuo black shales (Fig. 2). Given the observed vari- 
ability in RSE enrichments within the basal unit, this drop in RSE enrich- 
ments may record a local environmental shift towards less reducing 
conditions that are not conducive to redox-sensitive metal sequestration 
(see Supplementary Information). Alternatively, the decrease of RSE 
values may record a shift back to more reducing ocean conditions 
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Figure 3 | Summary of redox-sensitive trace elements and evolution of the 
ocean-atmosphere redox state. a, Atmospheric oxygen levels compared with 
present atmospheric level (PAL). The early Ediacaran metazoans*° appeared 
shortly after the Marinoan glaciation, during an increase in ocean—atmosphere 
oxygen level (modified from ref. 8). b, Temporal trends in V enrichments 
(crosses) and V/TOC ratios (circles) in euxinic black shales (Supplementary 
Tables 2 and 3). c, Temporal trends in Mo enrichments (crosses) and Mo/TOC 
ratios (circles) in euxinic black shales (Supplementary Tables 2 and 3). The spikes 
of V, V/TOC, Mo and Mo/TOC from the lower Doushantuo black shales (this 
study) are marked. d, Potential shift in nutrient-limiting factor from trace metals 
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to phosphorus after the Marinoan glaciation***. Data compilation, stratigraphic 


and chronological details are provided in the Supplementary Information. The 
Mo and V records indicate that the ocean experienced a late Neoproterozoic 
growth in the inventory of redox-sensitive trace elements, which can be linked to 
ocean ventilation. This redox shift was previously thought to have occurred at 
~551 Myr ago”"’. Our study provides evidence for an oxygenation event 
associated with increased seawater Mo-V enrichments and Mo/TOC and 
V/TOC ratios in the aftermath of the Marinoan glaciation, synchronous with or 
slightly pre-dating the earliest non-poriferan metazoan fossil record at ~632 Myr 
ago. 
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typical of the mid-Proterozoic (~1,800-700 Myr ago)—implying that 
the early Ediacaran oxygenation was not a unidirectional process (see 
Supplementary Information). However, this possibility requires a 
more comprehensive test in broader palaeogeographic settings of the 
Nanhua basin and in other global successions. 

Highly negative pyrite sulphur isotope (O”'Sisyiite) values down to 
-35%o from the basinal samples (Fig. 2; Supplementary Table 1) 
further support our interpretation of an oxygenated ocean—atmosphere 
system following Marinoan deglaciation. Coeval (~635-630 Myr) 
carbonate-associated sulphate sulphur isotope (3°4Scas) values from 
shelf sections*** have an average of ~34%o. In this framework, the 
calculated isotope fractionation between pyrite and coeval sulphate 
(A**S) in the deep basin section is >65%o—equivalent to maximum 
fractionations by modern sulphate reducing bacteria*® and the 
maximum pyrite-coeval sulphate offset observed in the Phanerozoic 
rock record®*' (Supplementary Fig. 4). An increase in the isotopic off- 
set between pyrite and seawater sulphate in the late Neoproterozoic has 
been commonly linked to growth of the global marine sulphate 
reservoir and surface oxidation””". The large sulphur isotope fractiona- 
tion in the basal Member II shales therefore points towards a perva- 
sively oxygenated ocean with lower associated pyrite burial, consistent 
with the conclusion drawn from RSE enrichments. 

The shift to pervasively oxygenated oceans and, by inference, higher 
atmosphere O, conditions following the Marinoan glaciation may be 
linked to high nutrient availability*. It has been proposed that there 
was a large marine phosphate reservoir during and following the 
Cryogenian glacial events”*. Phosphate is commonly considered as 
the ultimate limiting nutrient on geological timescales, as nitrogen 
(N) can be supplied by biological nitrogen fixation from an essentially 
limitless atmospheric supply. Therefore, glacially induced perturba- 
tions to the phosphorus cycle may have triggered an organic carbon 
burial event”® that induced a shift towards higher oxygen levels. 

This model, however, overlooks the potential for persistent N stress 
linked to trace-metal biolimitation**. In a broadly anoxic ocean, 
chalcophilic trace metals (for example, Mo, Cu, Cd) may be co-limiting 
nutrients and inhibit efficient N fixation—just as in the modern oxic 
marine system Fe stress limits N fixation over broad swaths of the 
ocean”’. Significant Mo biolimitation may have limited the organic 
carbon production and stabilized the redox state of the oceans through 
the mid-Proterozoic*. In this light, a jump in sulphidic black shale Mo 
enrichments at the Cryogenian-Ediacaran transition probably records 
a shift in nutrient regimes as well as in global redox conditions. 

Marine chalcophilic trace-metal levels following glacial retreat must 
have been sufficient to allow for significant organic carbon burial and 
the switch to a more pervasively oxygenated ocean. Dissolved marine 
Mo may have built up to the critical levels needed to move out of a 
marine system with chronic Mo-induced N stress during glaciation. A 
brief episode (for example, thousands of years) of increased Mo 
delivery associated with high rates of postglacial weathering before 
the deposition of the Doushantuo Member II shales may have also 
helped to provide the perturbation needed to overcome persistent Mo 
biolimitation. Mitigating coupled Mo-N stress would have allowed for 
efficient utilization of a large marine phosphate reservoir’*— 
ultimately promoting organic carbon burial and oxygen release. 
Ventilation of the ocean would lead to significant growth of the Mo 
reservoir (and other RSE reservoirs) as manifest in the Member II 
shales deposited several million years after glacial retreat. The formerly 
enigmatic association of the ‘Snowball Earth’ glaciations and early 
metazoan and algal diversification could thus be linked to shifts in 
nutrient availability, burial of organic carbon, and ultimately the shift 
to a more oxygenated ocean-atmosphere system that favoured the 
early diversification of metazoan life and ecosystems. 
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Response of salt-marsh carbon accumulation to 


climate change 


Matthew L. Kirwan! & Simon M. Mudd? 


About half of annual marine carbon burial takes place in shallow 
water ecosystems where geomorphic and ecological stability is 
driven by interactions between the flow of water, vegetation growth 
and sediment transport’. Although the sensitivity of terrestrial and 
deep marine carbon pools to climate change has been studied for 
decades, there is little understanding of how coastal carbon accu- 
mulation rates will change and potentially feed back on climate””. 
Here we develop a numerical model of salt marsh evolution, 
informed by recent measurements of productivity and decomposi- 
tion, and demonstrate that competition between mineral sediment 
deposition and organic-matter accumulation determines the net 
impact of climate change on carbon accumulation in intertidal 
wetlands. We find that the direct impact of warming on soil carbon 
accumulation rates is more subtle than the impact of warming- 
driven sea level rise, although the impact of warming increases with 
increasing rates of sea level rise. Our simulations suggest that the 
net impact of climate change will be to increase carbon burial rates 
in the first half of the twenty-first century, but that carbon-climate 
feedbacks are likely to diminish over time. 

Salt marshes, mangroves and seagrass meadows are among the most 
valuable ecosystems in the world, yet vulnerable to climate change and 
direct anthropogenic disturbance***. Understanding how coastal 
ecosystems and their carbon pools react to climate change and sea level 
rise is difficult because landscape-forming processes and biological 
processes work on similar timescales and are thoroughly intertwined. 
For example, vegetation growth depends on physical conditions such as 
sea level rise and inundation duration, but can also mediate physical 
conditions because plants influence their elevation by accreting peat 
and trapping mineral sediment®. Rates of carbon accumulation are 
therefore dependent on ecosystem engineering processes, and in some 
cases, determine whether a marsh survives sea level rise. 

Recent work suggests that some individual aspects of global change 
may increase the resiliency of coastal wetlands facing sea level rise and 
their ability to sequester carbon. For example, increased CO2, warmer 
temperatures, and moderate increases in rates of sea level rise all tend to 
increase rates of plant productivity, marsh accretion, and presumably 
the ability of marshes to survive sea level rise”"’*. However, accelerated 
sea level rise also leads to more mineral sediment deposition, which 
influences the elevation of coastal wetlands in ways that might limit 
their ability to sequester carbon'*"*. Increases in decay rates may 
accompany increased CO, and warmer temperatures'*’°. Finally, the 
accumulation rate of labile organic matter tends to equilibrate through 
time because decay rates are proportional to the size of the active 
carbon pool’*. Here we model these ecogeomorphic feedbacks to 
explore how one of the most complex carbon sinks responds to 
interactive components of global change including temperature warm- 
ing and sea level rise. 

Our numerical model couples two previous models that separately 
simulated the influence of vegetation on the deposition of mineral 
sediment” and the interactions between inundation, root zone pro- 
cesses and carbon accumulation’. In our model, inundation stimulates 


plant growth up to a threshold water depth, as observed in long-term 
records of salt marsh biomass and field experiments”’*’’. Increased 
temperatures tend to increase rates of productivity and decay in the 
model according to sensitivities determined from a latitudinal gradient 
and field experimentation’®'®. Organic matter produced by root 
growth is distributed throughout the soil profile when productivity 
exceeds decay, and changes in soil volume (that is, marsh elevation) 
affect both mineral and organic components of accretion. Whereas 
previous modelling efforts focused on the influence of mineral 
sedimentation on marsh evolution, we deliberately chose parameter 
values (sediment grain size, suspended sediment concentrations) that 
allowed us to explore the evolution of organic-matter-rich marshes 
dominated by root-zone processes (Table 1). In the discussion below, 
we refer to organic-matter accumulation and carbon accumulation 
interchangeably, given that the two are intrinsically linked’*. 

We first performed model experiments that explored how competi- 
tion between enhanced productivity and decay associated with warmer 
temperatures determine carbon accumulation rates. We took a 
hypothetical marsh dominated by the macrophyte Spartina alterniflora 
in equilibrium with a slow, constant rate of sea level rise (1 mm yr’), 
and subjected it to an instantaneous 4°C increase in temperature. 
After the temperature change, we found that rates of organic-matter 
decomposition exceeded rates of root production, leading to a slight 
decrease in organic-matter accumulation (Fig. la). Concurrently, 
warming enhanced aboveground productivity and its ability to trap 
mineral sediment, leading to a small increase in vertical accretion rate 
and marsh elevation relative to sea level (Fig. 1b). When temperature 
was perturbed under a higher rate of sea level rise (3mmyr '), 
ambient plant productivity was relatively high, so that warming led 
to an increase in productivity that was greater than the increase in 
decay rate. Thus, rates of carbon accumulation and vertical accretion 
both increased after the step change in temperature (Fig. lc, d). 
Accretion rates were momentarily higher than the rate of sea level rise, 
so the marsh platform built up elevation relative to sea level, and 
reduced inundation led to progressive reduction of plant productivity 
and vertical accretion. Consistent with a variety of numerical models of 
salt marsh evolution’, vertical accretion rates decreased until they 
approached a new equilibrium with the rate of sea level rise. Our results 
predict that the new equilibrium condition (under a moderate rate of 
sea level rise and warmer temperature) will be characterized by a higher 
elevation of the platform relative to sea level, a small increase in the rate 


Table 1 | Parameter values used in model simulations 


Parameter Value 
Sediment concentration, C 1 mgl? 
Particle diameter 0.03 mm 
Maximum biomass, B 2,500gm * 
Fast decay, k; 2yrt 

Slow decay, k, 0.001 yr? 
Spring tidal range 14m 


Parameters not listed have values identical to those in refs 12 and 16. 


1Department of Environmental Sciences, University of Virginia, PO Box 400123, Charlottesville, Virginia 24151, USA. 2School of GeoSciences, University of Edinburgh, Edinburgh EH8 9XP, UK. 3Earth 
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Figure 1 | Response of organic-matter accumulation and vertical accretion 
rates to an instantaneous 4 °C change in temperature at year 0. 

a, b, 1mm yr | rate of sea level rise. c, d. 3 mm yr rate of sea level rise. The 
solid line denotes the model with temperature influence on productivity and 
decay, and the dashed line denotes the model with no effect of temperature on 
decay. The sharply negative accretion and organic accumulation rates at year 0 


of plant productivity and organic-matter accretion, but no change in 
vertical accretion rate. 

This simple experiment provides two important insights into carbon 
dynamics in salt marshes. First, it illustrates that if plant productivity 
drives changes in wetland elevation, then the results of short-term 
experiments are likely to overestimate the long-term effect of climate 
change on carbon burial and vertical accretion. Field and laboratory 
experiments, for example, demonstrate that increased CO, leads to an 
increase in productivity and marsh accretion*’. Our model results are 
consistent with these observations because enhanced productivity via 
warming also leads to an increase in marsh accretion rate. The response 
was most pronounced when the effect of temperature on decomposi- 
tion was not modelled (dashed line in Fig. 1b, d), analogous to 
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Figure 2 | Impact of warming under constant sea-level-rise rates. a, Net 
impact of warming on organic-matter accumulation rate defined as the 
deviation from ambient accumulation rate 100 years after the step change in 
temperature modelled in Fig. 1. The net influence of warming is negative at low 
rates of sea level rise, but positive at high rates of sea level rise. For experiments 
with different decay coefficients see the Supplementary Information. b, Impact 


should be considered a model artefact arising from enhanced decay that takes 
place immediately, whereas enhanced productivity takes one year to reach the 
soil profile in the model. Enhanced productivity and decay have no long-term 
effect on accretion rate (b, d), but have moderate impacts on organic-matter 
accumulation (a, c). 


increased-CO, experiments that do not warm soil temperatures’. 
However, our experiments indicate that the enhanced vertical 
accretion rate quickly returns to the rate of sea level rise because the 
progressive building of wetland elevations ultimately limits plant pro- 
ductivity. At the same time, losses of labile soil carbon increase in 
proportion to the size of the growing carbon pool. Together, decreasing 
rates of plant productivity and increasing rates of decay cause carbon 
burial rates to decline after their initial increase (Fig. 1d). 

A second implication of our results is that the rate of sea level rise, 
and its influence on ambient plant productivity, determines the net 
response of organic accumulation rates to climate change. Warming 
leads to reductions in organic matter accumulation at low rates of sea 
level rise (Fig. 1a), while warming leads to increases in organic-matter 
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of warming on the fraction of organic material deposited at the soil surface. 
Warming leads to an increase in mineral content at low rates of sea level rise, 
but an increase in organic content at high rates of sea level rise. In general, 
warming temperatures and accelerating sea-level-rise rates lead to more 
organic-enriched soils until the marsh submerges (4.5mm yr’ in this 
experiment). 
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Figure 3 | Impact of warming under accelerating sea-level-rise rates. 

a, Response of organic-matter accumulation rate to temperature-driven 
changes in productivity, decay and sea-level-rise rates over the next century. 
b, Response of accretion rate to the same warming and sea-level-rise scenario. 
For most of the twenty-first century, plant productivity and organic-matter 
accumulation rates increase with progressive inundation of the marsh surface. 


accumulation at high rates of sea level rise (Fig. 1c). Although the net 
impact of warming declines through time in either case, discernible 
differences in carbon accumulation persist for centuries. Figure 2 
summarizes the impact of warmer temperature on long-term 
(100-year) carbon accumulation rates over a variety of sea-level-rise 
rates. Vertical accretion rates equilibrate to the rate of sea level rise 
within 100 years, so the response of carbon accumulation rates to 
warming is driven by competition between organic-matter production 
and mineral sediment trapping efficiency. More productive plants in a 
warmer climate can have a positive influence on either process. At low 
rates of sea level rise, the marsh starts high in the tidal frame with little 
productivity. Enhanced productivity associated with warming leads to 
more inorganic trapping at the expense of organic accumulation 
(Fig. 2a), and marsh soils become more mineral-rich (Fig. 2b). At high 
rates of sea level rise, the marsh starts low in the tidal frame with high 
plant productivity and high organic-matter accumulation rates. 
Enhanced productivity leads to an increase in organic accumulation 
at the expense of inorganic sediment trapping (Fig. 2a), and the marsh 
becomes more organic-matter-rich (Fig. 2b). Thus, the effect of 
increased productivity on organic-matter accumulation depends on 
the background rate of sea level rise, and the elevation of the marsh 
relative to sea level. These processes suggest that predicting carbon 
cycling in rapidly accreting landscapes such as deltas, floodplains and 
wetlands requires knowledge not just of how plants respond to climate 
change, but also of how vegetation and geomorphic processes interact. 

Given that sea-level-rise rates influence the net impact of warming 
on carbon accumulation, but are themselves a function of temperature, 
we considered a final experiment that simulated the evolution of a 
hypothetical marsh over the next century in which these variables 
interact (Fig. 3). Temperature projections followed the IPCC A2 
scenario’’ (an increase of 3.7 °C by 2100), and drove the rate of sea 
level rise according to a second-order semi-empirical relationship”. 
The experiment began with a 100-year spin-up period in which carbon 
pools and marsh elevations grow and equilibrate with the historical 
rate of global sea level rise, which we take to be 1.7 mm yr! (ref. 19). 
Although plant productivity accelerated with inundation and temper- 
ature throughout the experiment, more complex behaviour related to 
the influences of temperature and carbon pool size on decay rates 
emerged late in the simulation. As in Fig. 2, warmer temperatures 
led to little change in the organic accumulation rate when the rate of 
sea level rise was 1.7 mmyr |. As sea level rise accelerated, and 
ambient productivity increased owing to relative marsh lowering, 
the effect of enhanced productivity became more pronounced. For 
most of the twenty-first century, warmer temperatures and faster 
sea level rise rates led to an increase in organic-matter production, 
carbon accumulation, and vertical accretion. This increasing organic 


552 | NATURE | VOL 489 | 27 SEPTEMBER 2012 


2000 2020 2040 2060 2080 2100 


6 With warming ] 
<c 4 Ambient ] 
ae conditions | 
Ee 
£ 0 
oO 
g -2 
< -4 
2 
3 -6 
8 -8 
<x 


! 
2000 2020 2040 2060 2080 2100 
Year 


The abrupt decline in organic-matter accumulation and marsh accretion near 
2085 is due to plant mortality as plants cross an inundation threshold. At this 
point, carbon ceases to be deposited into marsh sediment, even as existing 
carbon near the soil surface continues to decay. For alternative climate 
scenarios and the impacts of varying suspended sediment concentrations, see 
the Supplementary Information. 


production led to a growing carbon pool, and with it a growing loss of 
labile organic matter through decay. Eventually the combined effects 
of a larger carbon pool and enhanced decay outstripped organic pro- 
duction, leading to decelerating rates of organic accumulation after 
2050 and declining rates of organic accumulation after 2075. Although 
enhanced plant productivity facilitated faster mineral sediment 
deposition, root zone processes dominated elevation change in this 
organic-matter-rich marsh, and vertical accretion rates also declined 
after 2075. With an accelerating rate of sea level rise and a declining 
rate of vertical accretion, the hypothetical marsh quickly lost elevation 
relative to sea level and vegetation drowned around 2085. 

For more than 30 years, the debate on whether carbon stored in 
ecosystems will exacerbate or mediate climate change has focused on 
the biogeochemical evolution of terrestrial and open-ocean systems”’”’. 
However, coastal ecosystems such as salt marshes have only recently 
been recognized as important components of the global carbon cycle””. 
Our results illustrate potential feedbacks between climate and carbon in 
the rapidly evolving coastal zone. We find that the net impact of tem- 
perature warming and sea level rise is to increase carbon burial rates 
in the first half of the twenty-first century (negative feedback with 
climate), but to slightly decrease carbon burial rates in the second half 
of the century (positive feedback with climate). In contrast, terrestrial 
ecosystems are likely to absorb a progressively smaller fraction of 
anthropogenic carbon dioxide emissions through the twenty-first 
century****. Thus, transient increases in carbon burial rates near the 
coast could potentially help offset the reduced uptake of carbon from 
terrestrial ecosystems. Interestingly, our model results suggest that a 
switch in feedback direction takes place even in marshes that are 
surviving sea level rise, and where rates of plant productivity accelerate 
in response to climate change. If sea level rises by more than a metre by 
2100, most models of marsh evolution indicate that marshes will be 
submerged and lose productivity. Our model considers only the 
temporal evolution of a single point within a hypothetical marsh. If 
the spatial extent of global marshes were to decline owing to sea level 
rise or anthropogenic modifications, then our prediction of long-term 
declining carbon burial rates would be amplified. 


METHODS SUMMARY 


To explore the temporal evolution of a hypothetical salt marsh platform, we 
coupled a model that simulates the influence of vegetation on the accumulation 
of mineral sediment’’ with a model that focuses on interactions between inunda- 
tion, root zone processes, and carbon accumulation’. The model is generally 
suited to represent salt marshes in protected microtidal-mesotidal estuaries, most 
explicitly those colonized by Spartina alterniflora, the dominant macrophyte in 
North American marshes. In the model, inundation stimulates plant growth up to 
a threshold water depth”’~’’ and increased temperatures tend to increase rates of 
productivity and decay’®’®. Standing biomass varies throughout the growing 
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season, and affects marsh accretion by trapping sediments during periods of marsh 
inundation and by depositing belowground carbon owing to root mortality. 
Belowground organic-matter production is estimated from aboveground biomass 
according to a depth-dependent root-to-shoot ratio'*. Compaction is computed 
on the basis of sediment overburden and organic content’. Dead biomass decays 
according to a linear decay model, where decay is influenced by the concentration 
of organic carbon, its depth below the soil surface, and temperature. For simplicity, 
we consider only two pools of soil carbon in our simulations: a quickly decaying 
pool of organic matter that reflects the decomposition of cellulose and amino-acids 
(k,), and a slowly decaying pool that reflects the decomposition of more refractory 
materials such as lignins (k,). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Vegetation modelling. To explore the temporal evolution of a hypothetical salt 
marsh platform, we coupled a model that simulates the influence of vegetation on 
the accumulation of mineral sediment’’ with a model that focuses on interactions 
between inundation, root zone processes, and carbon accumulation’*. The model 
is generally suited to represent salt marshes in protected microtidal-mesotidal 
estuaries, most explicitly those colonized by Spartina alterniflora, the dominant 
macrophyte in North American marshes. In the model, inundation stimulates 
plant growth up to a threshold water depth”'*"’ and increased temperatures tend 
to increase rates of productivity’. Between a minimum and maximum depth, 
measured below mean high high water (MHHW), the biomass during the peak 
growing season is: 


Bmax 
Dmax = Dyin 


(Dimensions are denoted in square brackets of [M] for mass, [L] for length and [T] 
for time). Here, Dyin [L] and Dyax [L] are the minimum and maximum depths 
below MHHW, B, ax is the peak season biomass [M L~7], Bmax IM L~?] is the 
biomass at the optimal depth below mean high tide, AT is the change in temper- 
ature from a reference temperature, and Opisa factor that accounts for the increase 
in biomass due to an increase in temperature. From a latitudinal gradient of 
S. alterniflora productivity in 56 North American marshes'’, we determine that 
x ~ 0.06 °C” |, in general agreement with experimental warming studies that also 
demonstrate a positive response to warming'’’®’’. The time-varying standing 
biomass B [M L~”] is calculated on the basis of on a sinusoidal approximation 
of the annual variation in marsh productivity (B fluctuates between 0 and B, 
during the year)'*. Standing biomass can then affect marsh accretion by trapping 
sediments during periods of marsh inundation and by depositing belowground 
carbon owing to root mortality. 

Sediment trapping. We model the trapping of mineral sediment as the sum of 
plant-mediated particle settling Q, [ML *T” ']) and direct capture of sediment by 
plant stems Q. [ML~? T~'] using the method of ref. 16. In S. alterniflora marshes, 
the mass flux from plant mediated settling is much greater than that from direct 
particle capture under typical flow conditions (<10cms_') (ref. 17). The particle 
settling flux is: 


Bpeak (D—Dpin)(1+4T og) (1) 


Q= (ws — Wap) C (3) 


where C [M L °] is the suspended sediment concentration, w, [L T~'] is the 
settling velocity of particles in still water and wy, [L T  ‘] is the upward velocity 
of particles due to turbulence. The upward velocity of particles is: 


0.2002u2(Cpa.d-)/ 
Wup = Kv 4,4" (Coacde) (4) 
Pw 


where u [LT *] is the flow velocity, K,x is von Karman’s constant, assumed to be 
0.4, 04. is a coefficient reported to be 0.9 (ref. 28) and Cp is the depth-averaged drag 
coefficient. a, [L~"] is the projected plant area per unit volume, calculated as 
a, = 4B, and d, is the stem diameter calculated by d, = 1B’. The drag coefficient 
is calculated using: 


ov OL[Te 
Cy =2( 8 +47 at?) (5) 
where y and ¢ are empirical coefficients reported to be 0.46 + 0.11 and 3.8 + 0.5, 
respectively”. During the rising limb of the tide we make the simplification that 
sediment concentrations are fixed owing to a supply of sediment-laden water from 
tidal creeks, and during the falling limb of the tide suspended sediment concen- 
trations are calculated by solving the sediment continuity equation'’. 

Compaction plays a significant part in determining the absolute elevation of 
marsh surfaces because the sediments that make up salt marsh soils are compressible. 
Here we use a constitutive equation verified by laboratory experimentation to model 
the compaction of the vertical soil profile. Following ref. 12: 


B=E)~Cllog(“) 
90 


where E [dimensionless] is the void ratio, CI [dimensionless] is the compression 
index, Eo [dimensionless] is the void ratio at the reference stress, ¢) [MT *L ‘], 
andoeg (MT *L '] isthe effective stress. Following ref. 12, we assume the column 


is under hydrostatic pressure and that the effective stress at any depth below the 
surface is simply the buoyant weight of the material above it. The compression 
index and reference void ratio depend on the substrate, where organic material in 
the modelled soil column is far more compressible than inorganic material’. 
Organic-matter production. Carbon is deposited in the subsurface via root 
mortality. The belowground biomass By,, [M L *] is related to the aboveground 
biomass through: 

=D + Deas (6) 
where ,, and Dnpm are the slope [L~?] and the intercept [dimensionless] of the 
relationship between the roots-to-shoots ratio and the depth below MHHW. 
These parameter values were determined from field experiments that examined 
the change in roots-to-shoots ratio with inundation for S. alterniflora’’. 

The difference between growth rate and mortality rate determines the change in 
standing biomass through the year. Mudd et al.’ approximated the measured 
growth of S. alterniflora as a sinusoidal function based on field data from North 
Inlet, South Carolina. This yields a mortality rate, M [M i T 4) of: 


2n[JD —JDpeak + | 2 


1 
M= 2 | Gnn + Gyo Gyo Grin 0s 365 


7 (7) 


365 365 


where JD is the Julian day (1 Jan = 1, 31 Dec = 365), JDpeak is the day of the year 
when aboveground biomass is at its peak and Bin [M L °] is the minimum 
aboveground biomass, Gag [M L-* T~'] is the rate of aboveground biomass 
production per unit area, Gin [M L * T '] is the minimum growth rate, Gpeak 
[ML *T™']is the peak growth rate, ¢ is the phase shift (in days) between the peak 
growing season and the date of peak biomass’. The mortality rate is then integrated 
over a time step and multiplied by the ratio between aboveground and below- 
ground biomass. 

Organic-matter decay. Fresh organic matter enters the soil profile upon mortality. 
Dead biomass decays according to a linear decay model (0C;/ot = —k;C;) where 
C; [M L~?] is the concentration in pool i, and k; [T~'] is the decay coefficient 
in pool i (refs 12, 30). The decay coefficient of pool i is calculated as 
k;=(14+ ATox)kio expld/;] where ko is the decay coefficient at a reference 
temperature, 0; is a coefficient that describes the increase in decay coefficient as 
temperature increases, d [L] is the depth below the marsh surface and ju; (0.4 m) isa 
coefficient that describes the reduction in decomposition with depth as organic 
sediments are buried under anoxic conditions’*. Kirwan and Blum'* found the 
coefficient 0; to be about 0.25 °C’ for a mid-Atlantic S. alterniflora marsh. This 
result is qualitatively consistent with other observations of decay that are more 
rapid in marshes during warmer portions of the year, and with observations from 
terrestrial ecosystems (see references within ref. 16). In our simulations, we 
consider two pools of soil carbon: a quickly decaying pool of organic matter that 
reflects the decomposition of cellulose and amino acids (k;), and a slowly decaying 
pool that reflects the decomposition of more refractory materials such as lignins 
(k,). Experimental studies of decay in salt marshes are limited to durations less than 
five years, so this approach is necessarily guided by terrestrial models of carbon 
dynamics*'. Nevertheless, slow decay over timescales of decades to millennia is 
apparent in the salt marsh stratigraphic record”. 


(Bpeak — Bin) sin Aa) 
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Delayed phenology and reduced fitness associated 
with climate change in a wild hibernator 


Jeffrey E. Lane’?*, Loeske E. B. Kruuk', Anne Charmantier’, Jan O. Murie® & F. Stephen Dobson* 


The most commonly reported ecological effects of climate change are 
shifts in phenologies, in particular of warmer spring temperatures 
leading to earlier timing of key events’*. Among animals, however, 
these reports have been heavily biased towards avian phenologies, 
whereas we still know comparatively little about other seasonal 
adaptations, such as mammalian hibernation. Here we show a sig- 
nificant delay (0.47 days per year, over a 20-year period) in the 
hibernation emergence date of adult females in a wild population 
of Columbian ground squirrels in Alberta, Canada. This finding 
was related to the climatic conditions at our study location: owing to 
within-individual phenotypic plasticity, females emerged later 
during years of lower spring temperature and delayed snowmelt. 
Although there has not been a significant annual trend in spring 
temperature, the date of snowmelt has become progressively later 
owing to an increasing prevalence of late-season snowstorms. 
Importantly, years of later emergence were also associated with 
decreased individual fitness. There has consequently been a decline 
in mean fitness (that is, population growth rate) across the past two 
decades. Our results show that plastic responses to climate change 
may be driven by climatic trends other than increasing temperature, 
and may be associated with declines in individual fitness and, hence, 
population viability. 

The timings of key periodic events, or phenologies, in wild animal 
populations are assumed to have evolved to synchronize life cycles 
with seasonal availabilities of resources’. As temporal resource distri- 
butions are shifting rapidly owing to climate change’, the extent to 
which phenologies respond should therefore have important con- 
sequences for both the fitness of individuals and the viability of popu- 
lations”®. Phenological shifts are indeed the most often cited ecological 
responses to climate change’”, with estimates of the mean responses 
across groups of species ranging from advances of 0.23 days (ref. 7) to 
0.55 days (ref. 8) per year. Moreover, there is evidence that populations 
that have either not shown a phenological advance’, or have exhibited 
insufficient advances to remain synchronized with their primary food 
sources’®”’ are more likely to be in decline. 

Unlike many bird species that undertake extensive migrations to 
cope with seasonal variation in resource availability, most mammals 
are sedentary and must therefore rely on adaptations to cope in situ. 
Hibernation is one such adaptation that is widespread across 
mammalian orders’* and is assumed to have evolved in response to 
environmental resource shortages”. However, the environment in 
which hibernation has most often been studied, is the laboratory. 
Our understanding of the causes and consequences of natural vari- 
ation in hibernation expression thus lags considerably behind that of 
its underlying physiological and biochemical mechanisms. The pheno- 
logy of hibernation, particularly the date of emergence, is considered to 
have important consequences for individual fitness’* but the relative 
lack of data from wild populations currently limits our ability to 
adequately test this hypothesis and, by extension, make predictions 
as to how hibernating species will be affected by climate change. We 


report here an analysis of hibernation phenology in a wild population 
of Columbian ground squirrels (Urocitellus columbianus). 

Columbian ground squirrels are small (less than 1 kg), obligate 
hibernators resident in the Rocky Mountains of North America. The 
short growing seasons of plants in the alpine and subalpine regions 
that they inhabit require that they spend the majority of the year (8-9 
months) hibernating’’. Over the past 20 years, the mean date of emer- 
gence from hibernation of adult females in our study population in 
Alberta, Canada, has become later by 0.47 + 0.15 days (+ s.e.m.) per 
year (Fi13 = 9.68, r = 0.35, P = 0.006; general linear model; Fig. 1a). 
To investigate the environmental factors contributing to this delay, we 
quantified the phenotypic responses of individuals to variation in two 
weather variables (date of snowmelt and spring temperature; see 
Methods for definitions) previously shown to influence emergence 
date in hibernating mammals®”*, as well as the annual trends in these 
predictors. 

Individuals responded plastically to both weather variables. Female 
ground squirrels emerged 0.07 + 0.03 days (+ s.e.m.) later with each 
1-day delay in snowmelt, and 1.51 + 0.47 days (+ s.e.m.) later with 
each 1-°C decrease in temperature (Table 1, Fig. 1b and Supplemen- 
tary Fig. 1). There was also significant variation between individuals 
in their average emergence date (partly because emergence date is 
heritable in this population'’), but not in individual plasticity in res- 
ponse to either climatic variable (that is, a significant difference in 
intercept, but not slopes of individual reaction norms'’; Table 1). 

Date of snowmelt has been delayed by 2.83 + 0.88 days (+ s.e.m.) 
per year over the course of the study (Fiig = 10.38, r = 0.37, 
P = 0.005; general linear model; Fig. 1c), but there has not been a 
significant trend in spring temperature (—0.10 + 0.05°C (= s.e.m.) 
per year; F)1g = 3.00, r = 0.10, P = 0.101; general linear model; 
Fig. 1c). The delay in snowmelt has been caused by an increasing 
likelihood of late-season snowstorms ( val = 8.97, P = 0.003; generalized 
linear model). During the first decade of the study, there was only 1 year 
that experienced a substantial snowfall event (that is, =5cm of snow 
deposited over >1 day) after the middle of April, whereas in the most 
recent decade 7 years experienced such late snowstorms. This result 
follows consensus projections from current climate models of elevated 
atmospheric moisture flux and, over most of North America, increased 
precipitation. Specifically, an overall increase in both total winter pre- 
cipitation and the frequency of heavy precipitation events is projected”. 

Adult females are engaged in reproduction for approximately half of 
the 3-4-month active season (that is, 51 days: 24 days of gestation and 
27 days of lactation”®), after which females and offspring must accu- 
mulate sufficient fat resources to endure the ensuing hibernation 
period. Therefore, we predicted that delayed emergence dates would 
have fitness costs for adult females. We calculated an annual contri- 
bution to individual fitness (hereafter referred to as ‘annual fitness’) 
from both individual over-winter survival (S) and reproductive success 
(R) as S + 0.5 X R (ref. 21) (analyses of S and R separately are provided 
in Supplementary Tables 1 and 2, and Supplementary Fig. 2). S was 
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scored as 0 or 1 depending on whether a female with a known emer- 
gence date in year, survived to the spring of year,,, (average across all 
years: 77.1 + 3.3 % (+ s.e.m.)). Adult females are philopatric” and the 
study population is fully censused; we were thus confident that all 
females surviving the winter were captured the following spring. 
R was calculated as the number of offspring that a female produced 
in a given year that survived over the winter. Average litter size is 
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Figure 1 | Variation in emergence date from hibernation and two 
environmental predictors of emergence (spring temperature and date of 
snowmelt) for adult female Columbian ground squirrels during the past 20 
years. a, Annual trend in mean emergence date (byear = 0.47, P = 0.006). b, The 
influence of date of snowmelt (bw (the within-subject effect) = 0.07, P = 0.031 
(from within-subject mean centred analysis)) and spring temperature 

(bw = —1.51, P = 0.003 (from within-subject mean centred analysis)) on 
emergence date. c, annual variation in the date of snowmelt (black circles; mean 
centred; by.a. = 2.83, P = 0.005) and spring temperature (open circles; mean 
centred; byear = —0.10°C, P = 0.101). Regression lines (with 95% confidence 
intervals) are indicated for all significant relationships in a and c. Data are 
shown as mean + s.e.m. 


approximately 3 (ref. 23) and the average over-winter survival rate 
of juveniles is 30-35%”; average R across all females and years was 
therefore 0.91 + 0.05 (+ s.e.m.) surviving offspring. Individuals of 
neither sex disperse as juveniles”, and all animals surviving their first 
winter were therefore captured as yearlings the following spring. 

We considered effects of variation both between and within years 
when investigating the relationship between annual fitness and emer- 
gence date. Years with late average emergence dates were associated 
with lower annual fitness of females (Table 2 and Fig. 2a), but there was 
no evidence of further effects of variation within years (Table 2; that is, 
within a given year, there was not a significant difference in the fitness 
of early versus late emergers). As average emergence date has become 
later over the past 2 decades, there has been an associated decrease in 
mean annual fitness within the population (byear = —32.63, Fii6 = 
7.58, P = 0.014; general linear model (where bya, is the linear 
coefficient term for year)). Although the decrease has slowed down 
recently (Drea = 8.00 X 10 °, F, 16 = 6.71, P = 0.020; general linear 
model, where Die is the quadratic coefficient term for year), partly 
owing to two recent years in which emergence was relatively early 
(2004 and 2010; Fig. 1a), mean fitness has also become more volatile 
(Fig. 2b). During the first decade of the study, only 1 of the 10 years 
(10%) exhibited a mean fitness value of less than 1 (that is, population 
decrease), whereas 4 out of 9 years (44%) exhibited such values in the 
most recent decade. 

Declines in population viability resulting from insufficient pheno- 
logical advances can theoretically be mitigated by adaptive phenotypic 
plasticity*® or microevolution® serving to resynchronize populations 
with their food resources. However, the phenological shift in 
Columbian ground squirrels is in the very direction that seems to be 
detrimental to fitness (Fig. 2a). Heavy snow cover prevents access to 
the squirrels’ vegetative food sources. Therefore, squirrels are unable to 
resume above-ground activity earlier. However, delayed emergence 
presumably challenges their ability to subsequently accumulate the 
necessary fat resources to survive ensuing hibernation. This challenge 
could be from a shortening of the active season and/or from the squir- 
rels attempting to fatten during a less energetically favourable period 
(for example, after vegetative senescence). Regardless of the mech- 
anism, the documented association between emergence date and indi- 
vidual fitness suggests that, should the projections of increased winter 


Table 1 | Effect of date of snowmelt and spring temperature on hibernation emergence date of female Columbian ground squirrels. 


Effects Model terms Coefficient (s.e.m.) 7? (df) P value 

Fixed effects Intercept 118.46 (0.77) 
Between-subject effect (snowmelt) 0.06 (0.04) 2.71 (1) 0.100 
Within-subject effect (snowmelt) 0.07 (0.03) 4.65 (1) 0.031 
Between-subject effect (spring temperature) —2.77 (0.76) 12.40 (1) <0.001 
Within-subject effect (spring temperature) —1.51 (0.47) 8.66 (1) 0.003 
Age Range: —4.93 (1.81; age = 9) to 0.95 (5.04; age = 11) 24.56 (9) 0.004 

Random effects Year 6.38 59.45 (1) <0.001 
Random intercept 8.28 34.62 (3) <0.001 
Random slope (snowmelt) 1.12 x 10-3 0.77 (3) 0.856 
Random slope (spring temperature) 0.05 0.35 (3) 0.950 
Residual error 21.08 

Model outputs are from a random regression model with within-subject mean centring (n = 512 records from 150 females, across 20 years). The influences of date of snowmelt and spring temperature are 


partitioned into between- and within-subject effects. Age, in years (as a 10-level categorical factor), is included as a fixed effect to control for age-related influences on emergence date and we report the maximum 
and minimum of the 10 age coefficients. The random effect structure estimates the variance in both the intercept and slopes of individuals in response to the two environmental variables (see Methods for details). P 


values are based on likelihood ratio tests. df, degrees of freedom. 


27 SEPTEMBER 2012 | VOL 489 | NATURE | 555 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


Table 2 | Influence of variation in hibernation emergence date on annual fitness of female Columbian ground squirrels. 


Effects Model term Coefficient 95% Highest posterior density intervals 
Fixed effects Intercept 1.23 0.88 to 1.52 

Between-year gradient —0.14 —0.27 to -—0.03 

Within-year gradient —0.03 —0.08 to 0.02 

Density —9.68 x 10° ~2.04 x 10°* to 1.90 x 10°° 

Age Range: —0.76 (age = 10) to 0.20 (age = 4) 1.01 to —0.15 (age = 10); 0.09 to 0.43 (age = 4) 
Random effects Identity 0.06 

Year 0.09 


Standardized linear (between-years and within-years) gradients, based on annual fitness, for hibernation emergence date of adult female Columbian ground squirrels. Age in years (as a 10-level categorical factor) 
is included as a fixed effect to control for age-related influences on fitness and we report the maximum and minimum of the 10 age coefficients. Density (number of adult females in the population) is included as a 
continuous fixed effect to control for density-dependent influences. Analyses are based on a sample size of n = 459 records, collected from 145 females for which we had both an emergence date phenotype and 
fitness estimate. The 95% highest posterior density intervals (from 1,000 Markov chain Monte Carlo samples) are indicated. Individuals are represented in the data set multiple times, and multiple records were 
produced each year, therefore the variances associated with the random effects of identity and year are also included. Complete definitions for all terms are provided in the Methods. 


a 2.5 

B 2.0 + 

oO 

= =e 

= 154 

& 

S 4.0+-$4 Se a 

$ te 

E se ee 

a . 
0.0 . . . +. +. 1 
-1.5 -1.0 -0.5 0.0 0.5 1.0 1.5 


Standardized emergence date 


Figure 2 | Influence of mean emergence date from hibernation on, and the 
annual trend in, mean annual fitness of adult female Columbian ground 

squirrels over the past 20 years. a, The influence of standardized emergence 
date (standardized to a mean of zero and unit variance) on mean relative annual 
fitness (individual values divided by the population mean) (from mean-centred 
selection analysis, 6, (between-individual selection gradient) = —0.14; highest 


precipitation hold true, continued phenological delays will lead to 
diminished and more volatile population dynamics. 

A recent study has shown that earlier snowmelts led to advances in 
emergence date from hibernation and concordant increases in over- 
winter survival of yellow-bellied marmots (Marmota flaviventris)°. 
Thus, although the direction of the trend is directly opposite to what 
we observed in Columbian ground squirrels, the links between the 
environmental predictor, emergence phenology and fitness are con- 
sistent. This is important because the overwhelming majority of 
research to date on the effects of climate change in natural systems 
has focused on increasing temperatures (for example, a previous study” 
summarized over 25,000 climate-change-induced phenological shifts 
and all of these were in response to temperature). Spring temperature 
is certainly the most ecologically relevant cue for many species, but 
over many areas of the globe its influence will be moderated by the date 
of snowmelt. Historically, these two cues should have been correlated 
as warmer springs led to earlier melting of the snow pack. However, 
over the past 20 years, this correlation has been weak at our study site 
(t1g = —1.85, r= —0.40, P = 0.081; Pearson correlation) as the date of 
snowmelt is now heavily influenced by an increasing prevalence of 
late-season snowstorms. The results presented here highlight the 
extent to which aspects of climate change other than increasing tem- 
perature may affect natural populations. 


METHODS SUMMARY 


We documented dates of emergence from hibernation and annual fitness of all 
adult females (n = 512 records from 150 females) within a fully censused popu- 
lation of Columbian ground squirrels in Alberta, Canada (50° N, 114° W and 
1,500 m above sea level) from 1992 to 2011. Unique alphanumeric eartags enabled 
individuals to be identified, and maternity assignment protocols are as described 
previously’’. The date of snowmelt and spring temperature were recorded at a 
nearby weather station and a complete explanation of the treatment of climatic 
data are provided in the Methods. We used general linear models to quantify 
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posterior density interval, —0.27 - —0.03). b, The annual trend 

(Byear = —32.63, P = 0.0145 byear” = 8.00 X 107°, P = 0.020) in mean annual 
fitness. Regression lines (with 95% confidence intervals) are indicated for 
both significant relationships and the threshold between population growth 
and decline (mean annual fitness = 1.0) is indicated in b. Data are shown as 
mean + s.e.m. 


annual trends in mean emergence date, the two environmental variables and mean 
fitness (n = 20 years, except for analyses involving fitness as 2011 data are not yet 
available). We also fit a generalized linear model with a binomial error structure to 
examine whether late-season snowstorms have become more likely in recent years. 
Plasticity of individuals was quantified using a random regression model with 
within-subject mean centering'*”’. We used a mean-centred mixed-effects model, 
implemented in a traditional linear selection analysis’””*, to examine the influence 
of hibernation phenology on annual fitness both within, and across, years. 
Nonlinear and correlational selection gradients are presented in Supplementary 
Table 3. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Study system. We used previously established protocols'” to document dates of 
hibernation emergence of adult female Columbian ground squirrels (=2 years of 
age). In brief, we began conducting daily surveys from observation towers before 
the emergence of the first individuals. As they emerged, squirrels were caught in 
live traps and identified with alphanumeric eartags. Almost all individuals 
(n = 136) were originally handled as juveniles and their ages were known with 
certainty. Adults in the first year of the study (1992; n = 4) and any immigrating 
animals (m = 10) received eartags on first capture and their ages were estimated 
either as yearlings or 2 years and older, based on body size and reproductive 
condition. 

Weather variables. The date of snowmelt for each year was determined from data 
collected at a nearby weather station in Okotoks, Alberta (Environment Canada, 
http://climate.weatheroffice.ec.gc.ca; 50° N, 114° W, and 1,081 m above sea level; 
approximately 40 km from the study site). Patterns of snowmelt in this area can be 
extremely variable owing to adiabatic warming of winds from the Pacific Ocean” 
and frequent late-season snowstorms. We therefore chose the first day of the year 
with 0cm of snow on the ground after continuous snow cover. However, if a 
substantial snowfall event (that is, =5 cm deposited over >1 day) occurred after 
this date, we selected the first day with 0 cm of snow after this event. The restriction 
to >1 day was chosen so that only snowfall events providing relatively consistent 
ground cover (as opposed to short-duration snow flurries) were included. A 
relatively moderate definition of =5 cm was chosen because the weather station, 
although relatively close in distance to the field site, is >400 m lower in elevation. 
Snowfall at the field site is therefore expected to be more severe. In 2 separate years 
(2002 and 2004) a substantial snowfall event occurred following the emergence of 
all females. The dates of snowmelt after these events were considered in the 
analyses of annual trends in weather variables. However, as they could not influ- 
ence ground squirrel phenology, the dates of snowmelt before these events were 
used in the plasticity analyses. 

Mean spring ambient temperatures (T,) were calculated based on daily tem- 
perature recordings taken at the Okotoks station and a sliding window approach 
was used to determine the relevant period over which to calculate average tem- 
perature. We varied the window from 2 to 30 days and considered all possible 
windows from 1 January to 15 June. We selected the window (Julian days: 102-123 
(12 April to 3 May in non-leap years)) that provided the highest r° with the mean 
annual emergence date (7° = 0.54). We used a Pearson’s product-moment cor- 
relation test to estimate the correlation between the two weather variables. 
Plasticity analyses. We used a random regression model to evaluate the influence 
of the two weather variables on the emergence dates of females'’. Because we 
wished to quantify the extent of within-individual plastic responses to environ- 
mental variation, we used a within-subject mean centring approach that decom- 
poses the environmental effects into those associated with the average 
environment experienced by an individual over its lifetime (a ‘between-individual’ 
effect), versus deviations of the environment in a given year from the individual’s 
lifetime average (the ‘within-individual’ plastic effect)”’. 

Emergence date (yj) for each individual i in year j was modelled as a continuous 
response to variation in the two weather variables, T, and snow: 


Ij = bo + bgTa,. + bw (Ta, —Ta,-) + bgnOW;. + bw (snowy —SHOW;.) +age 
+ [year] + [IDj] + [ID;. (snows —Snow; ) | 4 


[1D:.(Ta, Ta.) | + 25 


where bo represents the intercept and, as the weather variables were standardized 
to zero mean, is equivalent to the expected emergence date in the average environ- 
ment. The between-subject effect (bg) is estimated as the slope of y,, on the mean 
value of each weather variable for each individual, i. The within-subject (bw) effect 
is estimated as the slope of yj on the annual deviation of the weather variable 
from the subject mean. The random effects are shown in square brackets: [ID;] 
quantifies the variation in within-subject intercepts, and [IDj.(Ts, —T,,)] and 
[ID;. (snow; —Snow;) | quantify the variation in the within-subject slopes for 


the effects of the two weather variables (that is, the reaction norms'®). Year [year] 
was included in order to model variation across years that is not explained by the 
weather variables and to account for multiple measures per year. Age in years was 
included as a 10-level fixed effect and e; represents the residual error. Significance 
of model terms was assessed with likelihood ratio tests (—2 difference in log- 
likelihood between hierarchical models, tested against a chi-square distribution 
with the number of degrees of freedom that corresponded to the difference in the 
number of terms estimated). 

Annual trends. We used general linear models implemented in R (ver. 2.10.0; 
R Core Development Team) to determine whether there was a significant annual 
trend (year fitted as a continuous fixed effect) in mean emergence date or in either 
of the weather variables. We also fit a generalized linear model with a binomial 
error structure to examine whether late-season snowstorms have become more 
likely in recent years. The female that emerged the earliest over the 20 years did so 
on 9 April. Therefore, whether a snowstorm, which deposited at least 5cm of 
snow, was observed following 9 April (scored as 0 or 1) was entered as the depend- 
ent variable, with year fitted as a continuous fixed effect. 

Fitness implications of variation in emergence date. We calculated annual 
contributions to lifetime fitness (which, for simplicity, we refer to as a measure 
of ‘annual fitness’) for individuals as $ + 0.5 X R, following a previous study”’. $ 
represents over-winter survival and R represents annual reproductive success 
(defined in the main text). Prior to the analysis, emergence dates were standar- 
dized to a mean of zero and unit variance, and relative fitness was calculated by 
dividing by the overall mean’*. Because we wished to test whether any association 
between annual fitness and emergence date was driven by between- versus within- 
year variation, for each individual we considered effects of the annual mean 
emergence date (7,) and the deviation of that individual from the annual mean 


Vij —Yj)- 
= Bo + Ba¥; + Bw (9-7) +age+ density + [year] + [ID] +e 


In this model, «, represents relative annual fitness (divided by the overall 
population mean) of individual i at time j. fo is the intercept, Py represents the 
influence of the annual mean emergence date that year and fw represents the 
within-year variation. Age in years was included as a 10-level fixed effect. 
Population density (estimated as the number of adult females) varied substantially 
across the study period (range: 4-56) and was therefore included as a fixed effect to 
control for potential density-dependent effects on fitness. Explanations and results 
of quadratic and correlational analyses are included in Supplementary Table 3. 
Highest posterior density intervals were generated from 1,000 Markov chain 
Monte Carlo samples and are reported instead of standard errors and P values, 
as we cannot assume normality’. The random effects of identity [ID;] and 
[year] were fitted to account for multiple records per individual and per year, 
respectively. 

We used a general linear model to determine whether there has been a signifi- 
cant trend across years in the average annual fitness within the population and 
whether this trend has changed over time (by fitting both the linear and quadratic 
coefficients for year, respectively). Female survival rate (S) was calculated as the 
percentage of females surviving over the winter and annual reproductive success 
(R) was calculated as the mean number of pups surviving over the winter per 
female. Mean annual fitness across the population was then calculated as 
S$+0.5 x R Note that this also provides an approximation of population growth 
rate*’. 


29. Whiteman, C. D. Mountain Meteorology: Fundamentals and Applications. (Oxford 
University Press, 2000). 

30. Bates, D. M., Maechler, M. & Dai, B. Ime4: Linear mixed-effects models using S4 
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variation in demography of a house sparrow metapopulation. J. Anim. Ecol. 68, 
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Evidence for dietary change but not landscape use in 
South African early hominins 


Vincent Balter’, José Braga’, Philippe Télouk' & J. Francis Thackeray? 


The dichotomy between early Homo and Paranthropus is justified 
partly on morphology’. In terms of diet, it has been suggested that 
early Homo was a generalist but that Paranthropus was a specialist’. 
However, this model is challenged and the issue of the resources 
used by Australopithecus, the presumed common ancestor, is still 
unclear. Laser ablation profiles of strontium/calcium, barium/ 
calcium and strontium isotope ratios in tooth enamel are a means 
to decipher intra-individual diet and habitat changes. Here we show 
that the home range area was of similar size for species of the three 
hominin genera but that the dietary breadth was much higher in 
Australopithecus africanus than in Paranthropus robustus and early 
Homo. Wealso confirm that P. robustus relied more on plant-based 
foodstuffs than early Homo. A South African scenario is emerging 
in which the broad ecological niche of Australopithecus became 
split, and was then occupied by Paranthropusand early Homo, both 
consuming a lower diversity of foods than Australopithecus. 

The dietary specialization of Paranthropus is thought to have con- 
tributed to its demise because it was unable to adapt to the changing 
environmental conditions that took place approximately 1 million 
years ago’. From an ecological point of view, a small dietary breadth 
is generally correlated with narrow habitat preferences’. The evalu- 
ation of the size of the home range area, in addition to the estimation of 
the dietary breadth, would provide a detailed picture of resource use by 
the hominins. Using tooth enamel, the present work investigates the 
dietary breadth and the home range area of South African hominins 
with strontium/calcium (Sr/Ca) and barium/calcium (Ba/Ca) ratios, 
and strontium isotope (*’Sr/*°Sr) compositions. Dietary Sr/Ca and 
Ba/Ca proxies are based on the biological discrimination of Sr and 
Ba relative to Ca (that is, Ca biopurification), resulting in predictably 
lower Sr/Ca and Ba/Ca in a consumer, relative to its diet*. In bones, 
Sr/Ca and Ba/Ca decrease concomitantly up trophic chains*, but in 
enamel, and for a reason that is not yet fully understood, only low 
Ba/Ca ratios are characteristic in carnivores. Low Sr/Ca ratios are 
representative of browsers and carnivores’, whereas high Sr/Ca or 
Ba/Ca ratios in enamel are characteristic in grazers’. In contrast, the 
*7Sr/*°Sr of a consumer is free of any trophic process®. It only reflects 
the °’Sr/*°Sr composition of the bedrock (substrate) on which the 
consumer fed because the abundance of radiogenic *’Sr is determined 
by normalization of the stable isotope variations relative to a reference 
*°sr/**Sr ratio of 0.1194. 

The Sr/Ca, Ba/Ca and ®’Sr/*°Sr ratios are assessed using laser ablation 
profiles in permanent molars from the dentine enamel junction to outer 
enamel following the direction of the enamel growth prism’°. The 
specimens include Australopithecus africanus (n = 4), Paranthropus 
robustus (n=7) and early Homo (n=3) from Sterkfontein, 
Swartkrans and Kromdraai B, South Africa (Supplementary Table 1). 
For comparison, we also measured the Sr/Ca, Ba/Ca and 876r/°°Sr ratios 
in tooth enamel of grazers and browsers (n = 3) from Sterkfontein and 
Swartkrans. In most cases, several profiles were analysed for a given 
tooth to assess intra-tooth variability. The diagenetic effects on Sr/Ca, 


Ba/Ca and *’Sr/*°Sr of the Sterkfontein Valley fossil material are 
negligible (Supplementary Information). Several lines of evidence 
support this conclusion. First, if a sizeable pool of biological Sr is 
exchanged with a diagenetic component, this would affect the Sr iso- 
topic composition and the samples would fall on a Sr/Ca versus 
*’Sr/*°Sr line representing the mixing between the biological and the 
diagenetic Sr components. Taking the whole data set into account, it is 
evident that Sr/Ca and °’Sr/*°Sr ratios are not correlated (R? = 0.008, 
P= 0.260). As well as Ca, Sr and Ba, we also measured a suite of other 
trace elements (Mn, Zn, Rb, Y, La, Sm, Yb, Th and U), which are 
present at the sub-p.p.b. level in modern enamel, and which are taken 
up easily post mortem”®. No co-variation between Ca, Sr or Ba and the 
diagenetic proxies was found for any of the profiles (Supplementary 
Information). Last, the concentrations of the diagenetic proxies are 
very low, typically at the p.p.b. level, which is characteristic of pristine 
enamel and in accordance with previous results obtained using solu- 
tion inductively coupled plasma mass spectrometry (ICPMS)"". 

The *’Sr/*°Sr ratio ranges from 0.7197 to 0.7412 for the hominins 
and the bovids, with a mean value of 0.7309 + 0.0048 (+ 1 s.d.) (Fig. 1 
and Supplementary Table 1). Recently, a study’? reported *’Sr/*°Sr 
ratios in the enamel of hominins and animals from Sterkfontein and 
Swartkrans (min = 0.7197, max = 0.7454, mean = 0.7305 + 0.0054 
(+1.d.)). The °’Sr/*°Sr values of the two sets of data for hominins 
are indistinguishable (Fig. 1; Student’s t-test, P = 0.240). However, the 
data sets differ in the variability of the intra-tooth 8761/8°Sr ratio, which 
is provided by the coefficient of variation associated with the mean 
*’Sr/*°Sr value for a given tooth. In this study, the mean *’Sr/*°Sr 
values and their associated coefficients of variation are normally 
distributed (Shapiro-Wilk test, P = 0.306 and P= 0.107, for mean 
and standard deviation, respectively) and are not correlated 
(R? = 0.007, P = 0.659). In contrast, in the study mentioned above’, 
the mean 8’Sr/*°Sr values are normally distributed (Shapiro-Wilk test, 
P= 0.111), whereas the coefficient of variation values are log-normally 
distributed (Shapiro—Wilk test, P= 0.972) and the mean 876r/8°Sr 
values are positively correlated with their log-normalized coefficient 
of variation values (R* = 0.546, P< 10 *). The two data sets were 
obtained using different laser-sampling strategies, and this probably 
explains the observed patterns of intra-tooth *’Sr/*°Sr variability. 

As in the majority of studies using laser-ablation ICPMS on tooth 
samples’*'*""’, we carried out laser sampling on flat enamel surfaces. 
To do this, we used naturally broken teeth or teeth that had already 
been cross-sectioned mechanically. However, in the study mentioned 
previously’’, sampling was carried out using laser scans on the external 
surface of the enamel of whole teeth, which is a curved surface. This 
would have inevitably resulted in the sampling of laser-unfocused 
areas, an analytical bias that is known to produce variations of the size 
of the ablated particles and instrumental isotopic fractionation during 
ionization, leading to artificially scattered isotopic ratios'*. The 
sampling strategy that was used’* could explain why the smallest teeth 
were found to exhibit the highest intra-tooth *’Sr/*°Sr variability. 


1Laboratoire de Géologie de Lyon, UMR 5276, CNRS, Ecole Normale Supérieure de Lyon, 15 parvis René Descartes BP 7000, 69342 Lyon Cedex 07, France. *Computer-assisted Palaeoanthropology Team, 
UMR 5288, CNRS, Université de Toulouse Paul Sabatier, 37 Allées Jules Guesde, 31000 Toulouse, France. *Institute for Human Evolution, University of the Witwatersrand, PO Wits, Johannesburg 2050, 
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Figure 1 | °’Sr/*°Sr ratios of hominin and bovid enamel. The shaded area 
contains data from a previous study’. Error bars are 2-sigma standard 
deviations of the mean. 


Finally, we do not find statistically significant differences of the 
876/°°Sr ratio between hominin groups (Kruskal-Wallis, P = 0.266), 
or between the hominin group as a whole and the fauna (Kruskal- 
Wallis, P = 0.259). Thus, the South African hominins do not show 
clear systematic variation of the enamel 87¢1/°°Sr ratio, and we con- 
clude, on the basis of present results, that the home range areas of 
the three hominins were of similar size and composed of identical 
geological substrates. 

The Sr/Ca and Ba/Ca ratios tend to decrease from the enamel 
dentine junction to the outer enamel, in accordance with previous 
findings® (Supplementary Data). These trends have been interpreted 
to result from a positive gradient of mineralization, which favours Ca 
against Sr and Ba during the enamel secretory and maturation phases. 
In contrast to the Sr isotopic data, the elemental Sr/Ca and Ba/Ca data 
in the enamel of South African hominins display differences between 
the three genera, with a certain degree of overlap (Figs 2 and 3). The 
Sr/Ca ratio of A. africanus is significantly higher than that obtained for 
other hominin taxa, and the Ba/Ca ratio of early Homo is significantly 
lower than that obtained for other hominin taxa (Supplementary 
Table 2). The P. robustus specimens are characterized by Sr/Ca and 
Ba/Ca values that are intermediate between those of A. africanus 
and early Homo. Compared to the fauna, and taking into account 
both the Sr/Ca and Ba/Ca ratios, early Homo and P. robustus are 
indistinguishable from carnivores and browsers, respectively (Sup- 
plementary Table 2). With regard to A. africanus, the Sr/Ca and Ba/Ca 
ratios are indistinguishable from grazers and browsers, respectively 
(Supplementary Table 3). A possible interpretation is that the diets 
of early Homo and P. robustus were typical of carnivores and browsers, 
respectively, whereas the diet of A. africanus was more complex. 

Taking the intra-tooth variability of Sr/Ca and Ba/Ca into account 
gives a more complete picture of the dietary habits of hominins. The 
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Figure 2 | Sr/Ca ratios of hominin and bovid enamel. For both ratios, error 
bars are 2-sigma standard deviations of the mean, and the shaded areas contain 
data from a previous study’. 


variability of intra-tooth Sr/Ca ratios provided by the coefficient of 
variation of the mean Sr/Ca value for a given sample is higher for A. 
africanus (29.5 + 11.3% (+ 1s.d.)) than for the other hominin groups 
(P. robustus, 13.8 = 7.6%; early Homo, 8.6 + 2.7%; Supplementary 
Table 3). This is also the case for the variability of the intra-tooth 
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Figure 3 | Ba/Ca ratios of hominin and bovid enamel. For both ratios, error 
bars are 2-sigma standard deviation of the mean, and the shaded areas contain 
data from a previous study”’. 
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Ba/Ca ratio (A. africanus, 47.7 + 25.6%; P. robustus, 26.0 + 12.5%; 
early Homo, 23.8 + 8.0%; Supplementary Table 3). 

The diet of A. africanus has a high Sr/Ca and a low Ba/Ca ratio, but a 
monotonous consumption ofa Sr-enriched and Ba-depleted diet would 
not have produced the observed high intra-tooth Sr/Ca and Ba/Ca 
variability of A. africanus. It is more likely that the diet of A. africanus 
resulted from the combination of two distinct dietary sources. An asso- 
ciation of woody plants represented by leaves and fruits of trees and 
bushes, which are Sr-enriched and Ba-depleted*, and meat, which is 
both Sr- and Ba-depleted, could explain the enamel Sr/Ca and Ba/Ca 
patterns of A. africanus. Woody plants and meat products could have 
been consumed alternately by A. africanus on a seasonal scale. This is 
regarded as the most probable dietary association for the genus 
Australopithecus because it is the only way to reconcile conflicting 
results based on tooth morphology”, dental microwear* and carbon 
isotope ratios*’. This scenario is compatible with the idea of fallback 
resource use, by which it is proposed that food items of poor nutritional 
quality become an increasing food component as preferred foods 
become rare*’””. However, at this stage, it is not possible to state whether 
the fallback food was the woody plant or the meat component. 

The diet of P. robustus seems to have been less variable than that of 
A. africanus and was mainly based on woody plant foodstuffs. This is 
consistent with the measured average Sr/Ca and Ba/Ca ratios, which are 
indistinguishable from browsers, the reduced intra-tooth Sr/Ca and 
Ba/Ca ratio variability, and with dental microwear and tooth morpho- 
logy data***. Similar to P. robustus, the diet of early Homo was less 
variable than that of A. africanus, but contrary to the diet of P. robustus, 
it was based more on meat products. Our results support the idea that 
the degree by which woody plants and underground storage organs”! 
were consumed by A. africanus was reduced in the Homo lineage, 
whereas the more specialized masticatory apparatus of the robust 
australopithecines enabled them to have subsisted mostly on this type 
of food. However, this constitutes a South African scenario for which 
the diet of the Homo-like australopith A. sediba remains to be tested”*. 


METHODS SUMMARY 


The analysed tooth samples were housed at the Transvaal Museum (Ditsong 
National Museum of Natural History) in Pretoria, South Africa. The hominid 
teeth that were selected were naturally broken or already cut for microstructural 
studies purposes. This strategy of sampling enables chemical variations to be 
measured along profiles from the enamel dentin junction to the outer enamel. 
Sintered standard SRM-1400 (‘Bone Ash’) was used as a bracketing reference, both 
for elemental and isotopic measurements, to correct for instrumental biases. A 
quadrupole-ICPMS (ThermoElement X7) was used for the measurement of trace 
elements concentrations. Selected monitored elements were Ca, Mn, Zn, Rb, Sr, Y, 
Ba, La, Sm, Yb, Th and U. The ®’Sr/*°Sr ratio was measured on a Nu-HR (Nu- 
instrument) multicollector-ICPMS. The laser is a 157-nm F,/He excimer laser 
LPF202 of Lambda Physiks’. Contrary to our previous study’, we did not measure 
trace element concentrations and isotopic ratios on the same laser spot, because 
the Sr concentrations were too low in some of the samples, resulting in poor signal 
intensity on the multicollector-ICPMS. The trace elements and the isotopic ratios 
were therefore measured separately in two different sessions. The main opera- 
tional conditions are summarized in Supplementary Table 1. The standard bone 
ash SRM-1400, which is certified to contain 250 p.p.m. of Sr, produces typical 
signals of “Sr of approximately 5 10° counts per second (c.p.s.) on the 
quadrupole-ICPMS and approximately 700mV on the multicollector-ICPMS. 
We obtain an average Sr/Ca X 10° value of 0.668 + 0.042 (+ 2s.d.) and an average 
Ba/Ca X 10° value of 0.578 + 0.054 (+ 2.d.) for the standard SRM-1400 (n = 16). 
These numbers are close to the certified values of 0.652 and 0.629, respectively”. 
We obtain an average 876r/8°Sr value of 0.713068 + 0.000728 (+ 2s.d.) for the 
standard SRM-1400 (n=17), which is close to a thermal ionization mass 
spectrometry (TIMS) value of 0.713104 + 0.000019 (+ 2s.d.) that was reported 
previously*®. The method is described fully in the Supplementary Information. 
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Skin shedding and tissue regeneration in African 


spiny mice (Acomys) 


Ashley W. Seifert!??, Stephen G. Kiama’, Megan G. Seifert’?, Jacob R. Goheen**, Todd M. Palmer’? & Malcolm Maden! 


Evolutionary modification has produced a spectrum of animal 
defence traits to escape predation, including the ability to autotomize 
body parts to elude capture’”. After autotomy, the missing part is 
either replaced through regeneration (for example, in urodeles, 
lizards, arthropods and crustaceans) or permanently lost (such as in 
mammals). Although most autotomy involves the loss of appendages 
(legs, chelipeds, antennae or tails, for example), skin autotomy can 
occur in certain taxa of scincid and gekkonid lizards’. Here we report 
the first demonstration of skin autotomy in Mammalia (African spiny 
mice, Acomys). Mechanical testing showed a propensity for skin to 
tear under very low tension and the absence of a fracture plane. After 
skin loss, rapid wound contraction was followed by hair follicle regen- 
eration in dorsal skin wounds. Notably, we found that regenerative 
capacity in Acomys was extended to ear holes, where the mice 
exhibited complete regeneration of hair follicles, sebaceous glands, 
dermis and cartilage. Salamanders capable of limb regeneration form 
a blastema (a mass of lineage-restricted progenitor cells*) after limb 
loss, and our findings suggest that ear tissue regeneration in Acomys 
may proceed through the assembly of a similar structure. This study 
underscores the importance of investigating regenerative phenomena 
outside of conventional model organisms, and suggests that 
mammals may retain a higher capacity for regeneration than was 
previously believed. As re-emergent interest in regenerative medicine 
seeks to isolate molecular pathways controlling tissue regeneration in 
mammals, Acomys may prove useful in identifying mechanisms to 
promote regeneration in lieu of fibrosis and scarring. 

Among mammals, autotomy seems to have evolved several times, 
but is taxonomically sparse. Documented autotomy is typically 
restricted to the tail and occurs through loss of the tail sheath (false 
autotomy) or through breakage across the vertebra (true autotomy)””. 
In addition to tail autotomy, casual reference has been made to 
mammalian species with weak or fragile skin, although whether these 
animals are capable of skin autotomy remains unknown. Thus, we first 
sought to investigate anecdotal evidence that two species of African 
spiny mouse (Acomys kempi and Acomys percivali) readily shed 
portions of their skin as a predator escape behaviour. 

To test the hypothesis that A. kempi and A. percivali are capable of 
skin autotomy, we live-trapped individuals on rocky outcroppings 
(kopjes) in central Kenya. In addition to guard hairs, species in the 
genus Acomys are notable for the presence of spine-like hairs on the 
dorsum (Fig. 1a, b). Handling both species in the field confirmed that 
vigorous movement often led to tearing of the skin. Tearing resulted in 
large open wounds or skin loss ranging from small pieces to areas 
approximating 60% of the total dorsal surface area (Fig. 1c). In 
addition to integumentary loss, both species exhibited autotomy of 
the tail sheath as previously reported for other Acomys species, and 
individuals were often captured with missing tails’. Among captive 
individuals, we observed severe skin wounds to heal quickly, and rapid 
regrowth of spiny hairs totally obscured the wounded area (Fig. 1d, e). 
Field-captured individuals showed similar healing and, in some cases, 


patterned hair follicles in anagen (growth phase) that seemed to have 
regenerated in wounded areas (Fig. 1f). 

To evaluate how Acomys skin tears so easily, we asked whether the 
mechanical properties of Acomys skin might underlie its observed 
weakness. On the basis of experiments investigating skin autotomy 
in geckos’, weak skin (that is, skin possessing uniform structural 
properties that fails or breaks under relatively low induced loading) 
can be differentiated from fragile skin (that is, skin possessing specific 
morphological characterizations such as a fracture plane that allows 
the outer layers to be released). To assess skin weakness, we compared 
mechanical properties of Acomys and Mus skin. During mechanical 
loading, Mus skin displayed elastic properties before breaking, whereas 
Acomys skin was brittle and began tearing shortly after load was 
applied (Fig. 2a). We derived stress-strain curves from dorsal skin 
to determine the mean tensile strength (o,,) and found that Mus 
skin was 20 times stronger than Acomys skin (2.3 + 0.19 MPa and 


Figure 1 | A. kempi and A. percivali exhibit skin autotomy and subsequent 
rapid healing. a, b, A. kempi (a) and A. percivali (b) possess stiff, spine-like 
hairs on the dorsum. c, A. kempi after loss of dorsal skin. d, e, Scab formation 
after full-thickness skin injury visible at D3 (d). The same wounds in d are no 
longer visible at D30 (e), and new spiny hairs cover the damaged area at D30 
(e). f, Healing wound in field-caught specimen showing new hair follicles 
within the wound bed. Scale bars, 1 cm. 
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0.11 + 0.03 MPa (mean + s.e.m.)) (Fig. 2a, b). Lastly, by calculating 
mean toughness (W), we found that nearly 77 times more energy was 
required to break Mus skin relative to Acomys skin (Fig. 2b). These 
results demonstrate that Acomys possess skin that tears (or breaks) 
easily in response to low applied tension and provide a mechanical 
basis for the weakness of their skin. 

To evaluate whether structural properties of Acomys skin contributed 
to its mechanical weakness, we examined cellular features of A. percivali 
skin and found that it was anatomically comparable to that of Mus and 
other rodents, albeit with much larger hair follicles (Fig. 2c, d). We found 
no evidence of a fracture plane, which is the mechanism of skin 
autonomy in geckos and skinks’. Examining elastin fibres, which 
enhance skin elasticity, we found all three species possessed a similar 
distribution and abundance of elastin in the dermis and beneath the 
panniculus carnosus (Supplementary Fig. la-f). We tested whether 
larger hair follicles in Acomys skin reduced the total dermal area occupied 
by connective tissue by examining the proportion of adnexa (that is, 
follicles and associated glands) within the dermis and found it was 
greater in A. percivali (55.61 + 4.28%) compared with M. musculus 
(43.65 + 4.62%) (t = 1.9, P = 0.043) (Fig. 2e, f). These findings suggest 
that although the basic tissue structure of Acomys skin is similar to Mus, 
the space occupied by adnexa in the dermis reduces the absolute con- 
nective tissue content, potentially contributing to the decreased elasticity 
and lower tensile strength when the skin is placed under tension’. The 
lack ofa fracture plane underscores this finding and supports an inherent 
structural difference underlying the observed weakness of Acomys skin. 

Given its inherent structural weakness and propensity to tear, we 
assessed the ability of Acomys to heal skin wounds using small (4mm) 
and large (1.5 cm), full-thickness excisional wounds. Scab formation 
and haemostasis was rapid in wounds of both sizes and the wound area 
in large wounds was reduced by 64 + 3.1% 24h after injury (Sup- 
plementary Fig. 2a). During scar-free healing in terrestrial salamanders’ 
and mammalian fetuses*, the wound bed is re-epithelialized within 
several days, whereas a 4-mm wound in adult rat skin takes between 


5 and 7 days to re-epithelialize’. In Acomys, we found that five out of 
six 4-mm wounds had completely re-epithelialized by day 3 after injury 
(D3), whereas Mus wounds failed to re-epithelialize this quickly 
(Fig. 2g, h). After re-epithelialization, loose-skinned mammals (such 
as rodents and rabbits, for example) rely primarily on contraction to 
heal their wounds’®. Similarly, we observed high contraction rates, 
which accounted for 95% of wound closure after 17 days (Supplemen- 
tary Fig. 2a—c). In contrast to scarring, in which collagen fibres organize 
into a dense network parallel to the epidermis, during scar-free healing 
collagen fibres assume a pattern similar to unwounded dermis"®. 
Examining the extracellular matrix (ECM) at D10, we observed 
scarring in Mus, whereas in Acomys, collagen fibres were less densely 
packed and contained a more porous structure (Fig. 2i, j). Using 
picrosirius red we found collagen typeI predominated the wound 
bed at D10 in Mus, whereas collagen type III was in greater abundance 
in Acomys (Fig. 2k, 1). This difference was even more pronounced in 
1.5-cm wounds (Supplementary Fig. 3a-b’). Together, these data show 
that rapid re-epithelialization and wound edge contraction greatly 
reduce the size of open skin tears in Acomys. Our findings, that wound 
ECM is deposited slowly, has a porous configuration and is dominated 
by type III collagen, suggest that this composition favours regeneration 
over fibrosis during skin repair in Acomys. 

To test the regenerative capacity of the wound environment we 
sampled large healing wounds for evidence of hair follicle neogenesis 
and dermal regeneration. In association with the more porous ECM, we 
observed folliculargenesis of normal pelage hairs and large spiny hairs in 
the wound bed between D21 and D28 and we could distinguish old, large 
follicles near the wound margins from newly regenerated follicles within 
the wound bed (Fig. 3a—-d and Supplementary Fig. 3c—e). New follicles 
seemed to regenerate throughout the uncontracted portion of the wound 
bed not just in the central region (Fig. 3c and Supplementary Fig. 3e), and 
we observed regenerating hair follicles in various stages of development 
(Fig. 3a-m and Supplementary Fig. 4a—c). A localized and highly pro- 
liferative population of epidermal cells drives hair follicle development 


Figure 2 | Acomys skin is weak, tears easily, and 
during repair develops a porous ECM rich in 
collagen type III. a, b, Stress—strain curves for M. 
musculus (n = 6), A. kempi (n = 5) and A. percivali 
(n = 5), depicted up to the failure strain (a) and for 
one individual (b) approximating the real mean 
tensile strength (o,,) and mean toughness (W) 
(represented as shaded area). c, d, Masson’s 
trichrome staining of unwounded back skin from 
M. musculus (c) and A. percivali (d). e, f, The 
proportion of adnexa (hair follicles and associated 
glands) in the dermis (yellow shading) of 

M. musculus (e) and A. percivali (f). g, Cytokeratin- 
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stained keratinocytes (yellow arrowhead) just 
beginning to migrate in small wounds at D3 in M. 
musculus. h, Completely re-epithelialized wounds 
in Acomys at D3. WM, wound margin. Insets show 
relative wound position of the pictured tissue. 

i-l, Picrosirius red staining of small wounds in M. 
musculus (i, k) and A. percivali (j, 1). Bifringence of 
picrosirius stain (k, 1) differentiates thick collagen 
type I fibres (red/orange) from thin collagen type 
III fibres (green). Collagen fibres in M. musculus 
are predominantly type I, densely packed and run 
parallel to the epidermis (k). Collagen fibres in A. 
percivali are more porous with a greater proportion 
of collagen type III (1). Scale bars, 100 jum. 
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and we observed a similar phenomenon during follicle regeneration 
(Fig. 3e and Supplementary Fig. 4a—-c). To investigate whether embry- 
onic signalling networks used during hair follicle development were used 
during hair follicle regeneration, we examined keratin 17 (KRT17), 
which is diffusely expressed within the epidermis during skin develop- 
ment and becomes progressively restricted to developing hair follicles". 
After re-epithelialization, KRT17 was highly enriched throughout the 
neoepidermis overlying the wound bed at D14, and as new hair follicles 
formed in the wound bed, KRT17 became restricted to follicular epithe- 
lium (Fig. 3f and Supplementary Fig. 5). During wound repair in Mus, 
we found KRT17 was also highly upregulated in the re-epithelialized 
epidermis at D14 (Supplementary Fig. 5), and although KRT17 localized 
to some basal keratinocytes in Mus epidermis at D21, these sites failed to 
aggregate into placodes or new hair follicles such that KRT17 was com- 
pletely absent from the new epidermis by D26 (Fig. 3f). The disappear- 
ance of KRT17 from basal keratinocytes in Mus, together with our 
observation of continued localization in new placodes and hair follicles 
in Acomys, suggests the underlying dermal signals required to induce 
placode formation in Mus are lacking. 

Although the precise signal for placode formation remains obscure, 
there is an absolute requirement for Wnt-signalling during normal 
follicle formation’*. Nuclear localization of LEF1 protein has been used 
as a readout of this inductive signalling’. We detected nuclear accu- 
mulation of LEF1 in regenerating epidermal placodes, condensing 
dermal fibroblasts beneath the hair germ, and in dermal papilla 
and matrix cells (Fig. 3g, h and Supplementary Fig. 6a). We also 
detected nuclear LEF1 staining at low levels in some non-placode basal 
keratinocytes, whereas we did not detect nuclear LEF1 in the epidermis 
during wound healing in Mus, suggesting epidermal Wnt-activation in 
Acomys may partially underlie our observation of hair follicle regen- 
eration (Supplementary Fig. 6b, c). 

Regulation of canonical Bmp-signalling also has a role during hair 
follicle induction and differentiation of follicular progenitor popula- 
tions into the mature hair follicle (reviewed in ref. 14). Phosphorylation 
of SMAD proteins 1, 5 and 8 (pSMADI, 5 and 8) is a robust readout of 
canonical Bmp signalling. We detected pSMAD1/5/8 (the antibody 
detects the phosphorylated forms of all of these proteins but does not 
exclusively differentiate any particular one) at low levels during follicle 
induction and later at higher levels in dermal papilla and matrix cells 
undergoing differentiation in the hair bulb (Fig. 3i, j). Furthermore, 
we detected SOX2-positive dermal papilla in some regenerating hair 
follicles, which is consistent with its role in specifying various hair types 
during mouse hair follicle development’’ (Fig. 3k). Taken together, 
these results demonstrate that regenerating hair follicles in Acomys 
progress through defined stages of hair follicle development, exhibit 
high rates of proliferation, and reuse molecular pathways used during 
embryonic hair follicle development to regenerate new hair follicles. 

Adult mammal skin is normally unable to regenerate epidermally- 
derived structures in response to wounding (for example, glands and 
hair follicles). An exception to this is the observation of spontaneous 
folliculargenesis in large excisional wounds in rabbits, and more 
recently in laboratory mice (C57BL6/SJ, SJL or mixed strain)'*"*. 
Rabbits are also one of the few mammalian species that can regenerate 
large ear punch wounds”. We proposed that the regenerative capacity 
observed in Acomys might extend to their ear tissue as well. To test this 
we made 4-mm punches through the ears of both Acomys species and, 
to our surprise, found that they were able to close these large punches 
(Fig. 4a—c and Supplementary Fig. 7a—c). Uninjured ear tissue contains 
skin (epidermis and dermis), associated hair follicles, adipose cells, 
muscle and cartilage; we found that Acomys were capable of completely 
regenerating all of these tissues with high fidelity except muscle 
(Fig. 4b, c). Twelve days after injury we observed an accumulation of 
cells around the circumference of the wound beneath the 
epidermis, and, although regeneration of new tissue was centripetal, 
cells accumulated to a greater degree on the proximal side of the punch. 
Hair follicle and cartilage regeneration proceeded in a proximal to 
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Figure 3 | Acomys exhibit de novo hair follicle regeneration in wounded 
skin. a-d, Hair follicles regenerating in A. percivali (yellow arrowheads) 
between D21 and D28 in large skin wounds. New hair follicles (yellow 
arrowheads) are present throughout the wound bed (red dotted area) at D28 
(c, d). Green arrowheads indicate old follicles. e-k, Regenerating hair follicles 
express proteins associated with development and differentiation. Ki67 labels 
proliferating hair germ (e). KRT17 (yellow arrowheads) is present in Acomys 
but absent in Mus at D26 (f). Nuclear-localized LEF1 is present in follicle 
placodes (g) and later in dermal papilla cells (DP) and surrounding matrix cells 
(MX) (h). Phosphorylated SMAD1, 5 and 8 (as a readout of Bmp-signalling) is 
present in epidermal hair germ cells (i) and later in dermal papilla and matrix 
cells of regenerating follicles (j). SOX2 is present in dermal papilla cells 

(k). Scale bars, 100 ttm (a-d, f-k) and 50 um (e). 


distal wave (Fig. 4d, e) and similar to the skin, follicular epidermis in 
the ear activated Wnt-signalling (Supplementary Fig. 6d, e). In contrast 
to Acomys, we found Mus were incapable of regenerating 4-mm ear 
punches and instead formed scar tissue (Supplementary Fig. 8a, b). 
Notably, despite scar formation, Mus ear repair resulted in the de novo 
formation of cartilage condensations distal to the cut cartilage suggest- 
ing Mus might initiate, but not maintain, a regenerative response after 
ear wounding (Supplementary Fig. 8b). 
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It remains unclear whether mammalian regeneration proceeds 
through formation of a blastema, or is instead an exaggerated version 
of hyperplastic growth” ”’. Blastema formation is considered a hallmark 
of epimorphic regeneration. One characteristic of a regeneration 
blastema is that it contains proliferating cells and maintains prolifera- 
tion during regeneration”. We observed widespread proliferation 
throughout the ear regenerate in Acomys and surprisingly, throughout 
healing ear tissue in Mus (Fig. 4f, g). However, we noted a lack of 
proliferation in the distal epidermis of Acomys, whereas we detected 
proliferation throughout Mus epidermis extending to the distal tip 
(Fig. 4f, g). Although proliferation was maintained in Acomys ears, 
we observed almost no proliferating cells in later-staged Mus ears 
(Fig. 4h, i). 

A second characteristic of a blastema is the formation of a specialized 
epidermal signalling centre (the wound epidermis), which is required 
for proliferating blastemal cells to remain in the cell cycle** and is 
characterized by a loss of epidermal stratification, loss of basal 
keratinocyte polarity, and lack of a mature basal lamina”. After 
re-epithelialization in Acomys, we noted a thickening of the distal 
epidermis, disorganization of basal keratinocytes and absence of a 
mature basement membrane (Fig. 4j). Comparatively, the epidermis 
near the amputation plane exhibited normal stratification and 
possessed a prominent basement membrane (Fig. 4k). By contrast, 


564 | NATURE | VOL 489 | 27 SEPTEMBER 2012 


Mus seemed to form a wound epidermis only transiently after 
re-epithelialization, with a proportionately smaller distal area 
exhibiting these characteristics for a short time (data not shown). By 
D12 in Mus, collagen typeIV staining revealed a mature basement 
membrane beneath the entire epidermis of the healing ear (Fig. 41). 
In addition, the epidermis exhibited normal stratification and proper 
apical-basal polarity of the basal keratinocytes (Fig. 4g, 1). 

In addition to sustained proliferation and formation of the wound 
epidermis, ECM molecules have a key role in supporting proliferation 
and directing subsequent differentiation during regeneration”®. By 
contrast, molecules such as laminin and collagen type I, which 
favour differentiation, are downregulated in the blastema during 
amphibian limb regeneration and are expressed as differentiation 
of the musculoskeletal system proceeds”®”’. Histological examination 
of Acomys ears at D12 revealed high levels of fibronectin, some 
tenascin-C (TN-C) surrounding densely packed cells, but very low 
levels of collagen type I (Supplementary Fig. 9a—c). Collagen type III 
was also more abundant than collagen type I during regeneration 
(Supplementary Fig. 9d—-d’). TN-C became restricted from areas where 
new cartilage began differentiating and within these differentiating 
cells we found activation of the Bmp-signalling pathway in cells giving 
rise to new auricular cartilage (Fig. 40 and Supplementary Fig. 10). 
During hyperplastic growth in Mus ears, the ECM initially displayed 


Figure 4 | Acomys regenerate hair follicles, 
sebaceous glands, dermis, adipose tissue and 
cartilage in 4-mm ear punches. a, A regenerated 
4-mm ear punch in A. percivali. b, Unwounded 
tissue in Acomys ear pinna. c, Regenerated dermis, 
hair follicles, cartilage and adipose tissue within 
biopsy punched area. White circle denotes the 
original punch area. d, Regenerating hair follicles 
(yellow arrowheads) and cartilage (green 
arrowheads) differentiate proximal to distal. 

e, Safranin-O and fast green staining indicates 
chondrogenesis (green arrowheads). 

f-i, Proliferating cells (Ki67*) in early (f, g) and 
late (h, i) Acomys and Mus ears. Proliferation is 
restricted proximal to the wound epidermis (WE) 
(red arrowheads) in Acomys (f) and is continuous 
in basal keratinocytes of Mus (g). Proliferation is 
maintained in Acomys at D32 (h) with very few 
proliferating cells persisting in Mus (i) (red 
arrowheads). j-l, Collagen-IV-stained mature 
basement membrane is absent beneath the wound 
epidermis (E) in Acomys (j), but is present near the 
amputation (k) and distally in Mus (1). Yellow 
arrowheads indicate basement membrane, and 
white lines indicate epidermal thickness. 

m, n, Almost no «SMA-positive fibroblasts are 
present in Acomys (m), whereas «SMA-positive 
myofibroblasts are present in healing Mus ear 
(n). Inset shows stress fibres in individual 
myofibroblasts. 0, TN-C disappears where new 
cartilage differentiates (white arrows) in Acomys. 
Yellow/green cells (j-o) are autofluorescing blood 
cells in the green fluorescent protein (GFP) 
channel. Scale bars, 100 ttm. 
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high levels of fibronectin and low levels of TN-C as did Acomys ears, 
but produced relatively higher levels of collagen type I (Supplementary 
Fig. 9e-g). Collagen production in Mus was not only faster and more 
abundant, but also exhibited a higher ratio of collagen type I to type HII 
(Supplementary Fig. 9h, h’). Given the exuberant production of 
collagen type I in Mus, we asked whether resident fibroblasts were 
differentiating into myofibroblasts, which contribute to scarring in lieu 
of regeneration (reviewed in ref. 28). Using «-smooth muscle actin 
(aSMA), we found myofibroblasts in high abundance throughout ear 
tissue in Mus, whereas they were almost completely absent in Acomys 
ears (Fig. 4m, n). These data corroborate the importance of the wound 
ECM to promote proliferation while antagonizing differentiation and 
support previous work showing precocious collagen typeI formation 
antagonizes appendage regeneration”. 

Our data suggest that reparative ear regeneration in Acomys is a 
balance between premature reformation of the dermis (scarring) and 
maintenance of cell proliferation within a pro-regenerative environ- 
ment. By contrast, Mus fails to form (or maintain) a wound epidermis, 
which is coincident with precocious formation of the basement 
membrane and stratification of the epidermis. This leads to a loss of 
cell proliferation, increased collagen typeI deposition (in lieu of 
collagen type III), myofibroblast activation and ultimately, scar forma- 
tion. Although our data suggest ear regeneration shares similar 
characteristics with blastema formation, understanding the molecular 
signals required to organize and maintain a wound epidermis and 
identifying the lineage of regenerating cells is crucial to address how 
regeneration occurs in these animals. Future work investigating how 
Acomys can control fibrosis will shed light on how regeneration and 
scarring can be balanced in the face of infection and inflammation in 
wild mammals, and provides an ideal model system in which to 
examine epimorphic regeneration in mammals. 


METHODS SUMMARY 

Animals. Specimens of A. kempi and A. percivali were live-captured in Laikipia, 
Kenya, at the Mpala Research Centre. Experimental animals were held in an open- 
air field laboratory under ambient conditions. For comparative experiments, Swiss 
Webster mice were obtained from a local supplier in Nairobi and in the United 
States from Charles River. 

Wounding. All experiments conducted in Kenya were performed in accordance 
with approved animal practices. Experiments performed in the United States were 
approved by the Institutional Review Board on Animal Care at the University of 
Florida and in Kenya by the University of Nairobi. Spiny mice were halothane- 
anaesthetized and small (4mm) and large (1.5cm) full-thickness excisional 
wounds were made on the dorsum. 

Strength measurements. Strength measures were made using a Hounsfield 
tensometer equipped with an automatic motor and stretched at a rate of 
20mm min“ '. Full-thickness skin strips (including the panniculus carnosus) 
measuring approximately 20 x 40mm were used. Force (N) and displacement 
(mm) were measured on a xy plotter and these points were subsequently recorded 
as stress (o = force per cross-sectional area) and strain (¢ = change in length/ 
initial length) and replotted in Excel. The work (W) required to achieve the break- 
ing strength was calculated as the area under each stress-strain curve and 
expressed as the mean work for each genus. 

Histology and immunohistochemistry. For histological analysis, samples were 
fixed in 10% neutral buffered formalin at 4°C for 16-24h, washed in PBS, 
dehydrated in ethanol and infiltrated with paraffin. Samples were cut at 5 um. 
For immunohistochemistry, slides were incubated with primary antibodies (see 
Methods) and visualized using either 3,3’-diaminobenzidine (DAB) or Alexa- 
Flour 594. Negative controls were run using appropriate Ig isotypes at the same 
concentration as the primary antibody. For all immunohistochemistry com- 
parisons, a minimum of n = 4 per species was used. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Animals. Male and female A. kempi and A. percivali were live-captured using 
Sherman traps in Laikipia, Kenya, at the Mpala Research Centre, between 2009 
and 2011. Fifty traps were set in the late afternoon at various kopjes (rock 
outcroppings), and traps were checked the next morning for animals. Captured 
animals were transported to an open-air field laboratory and held under ambient 
conditions in species-specific groups with access to water ad libitum. Animals were 
fed twice daily on peanut butter and oats. Some animals were transported to the 
University of Nairobi, where they were maintained under similar conditions. For 
comparative experiments, Swiss Webster mice were obtained from a local supplier 
in Nairobi and in the United States from Charles River. 

Wounding. All experiments conducted in Kenya were performed in accordance 
with approved animal practices. Experiments performed in the United States were 
approved by the Institutional Review Board on Animal Care at the University of 
Florida and in Kenya by the University of Nairobi. Both male and female mice 
were used, but pregnant females were excluded. Three types of wounds were used 
for this study: natural wounds, small circular wounds (4 mm diameter), and large 
circular wounds (1.5 cm diameter). Spiny mice were halothane-anaesthetized and 
shaved at least 1 day before wounding. Natural wounds were not made by the 
investigators and occurred either in the field or in captivity. Full-thickness exci- 
sional wounds were made by pinching the skin with forceps and cutting beneath to 
form a 4-mm circle (small) or cutting with iridectomy scissors a 1.5-cm diameter 
circle (large). For small wounds, four wounds were made on the dorsum posterior 
to the forelimbs and anterior to the hindlimbs. Wounds were allowed to heal and 
were not treated during healing. Calipers were used to measure wound width 
(mm) and wounds were recorded using a Canon S900 (1-4 Macro zoom). 
Strength measurements. After culling, the entire dorsal skin including the 
panniculus carnosus was removed and placed face down in PBS. Two strips of 
dorsal skin measuring 20 X 40 mm were excised from this preparation maintain- 
ing the anteroposterior orientation, and the second strip was returned to PBS while 
the first strip was measured. Strips for mechanical testing were placed in metal 
screw clamps with rubber pieces covering the clamped ends. Clamps were placed 
in a Hounsfield tensometer equipped with an automatic motor and stretched at a 
rate of 20mmmin !. Tissue measurements for width (mm) and length (mm) 
between clamps were measured before stretching, and PBS was applied before 
loading to keep the samples moist. Force (N) and displacement (mm) were 
measured on a xy plotter, and these points were subsequently recorded as stress 
(o = force per cross-sectional area) and strain (¢ = change in length/initial length) 
and replotted in Excel. 

Load was applied to skin strips parallel to the long body axis and the tensile 

strength (o,,) and failure strain (¢¢) were recorded (M. musculus (n = 6), A. kempi 
(n = 5) and A. percivali (n = 5)). The work (W) required to achieve the breaking 
strength was calculated as the area under each stress-strain curve and expressed as 
the mean work for each genus. 
Histology and immunochemistry. For histological analysis, samples were fixed 
in 10% neutral buffered formalin at 4 °C for 16-24 h, washed in PBS, dehydrated in 
ethanol and infiltrated with paraffin. Samples were cut at 5 um. For immunohisto- 
chemistry, slides were incubated with primary antibodies (see Methods) and 
visualized using either 3,3’-diaminobenzidine (DAB) or Alexa-Flour 594. 
Negative controls were run using appropriate Ig isotypes at the same concentra- 
tion as the primary antibody. For all immunohistochemistry comparisons, a min- 
imum of n = 4 per species was used. 


Primary antibodies and antigen retrieval used were: tenascin-C (1:50; Abcam; 

ab3970) with pronase at 1 ,1gml' for 10 min at room temperature; fibronectin 
(1:500; Abcam; ab23750) with heat-induced epitope retrieval (HIER) in citrate 
buffer (pH6.0); pan-cytokeratin (1:1,000; DAKO; Z0622) with proteinase K 
(DAKO) for 2min at room temperature; cytokeratin 17 (1:400; Abcam; 
ab53707) with proteinase K for 2 min; Ki67 (1:2,000; Abcam; ab15580) with 
HIER in Tris-EDTA (pH 9.0); LEF1 (1:4,000; gift from O. Huber) with HIER in 
citrate buffer (pH 6.0) followed by treatment with 0.1% trypsin for 10 min room 
temperature; f-catenin (1:1,000; Sigma; HPA 029159) with HIER in citrate buffer 
(pH 6.0); pSMAD1/5/8 (1:200; Cell Signaling; 9511) with HIER in Tris-EDTA 
(pH 9.0); SOX2 (1:150; Abcam; ab97959) with HIER in citrate buffer (pH 6.0); 
collagen type IV (1:500; Rockland; 600-401-106-0.1) with proteinase K for 2 min; 
aSMA (1:200; Abcam; ab32575) with HIER in citrate buffer (pH 6.0); collagen type I 
(1:500; Abcam; ab34710) with HIER in citrate buffer (pH 6.0) followed by proteinase 
K for 2 min. For detection of rabbit antibodies, Vector Elite anti-rabbit kits were 
used, and for mouse monoclonal antibodies (TN-C) Vector M.O.M. kits were used. 
Antibodies were visualized with either diaminobenzidine (DAB) (Vector) or a 
streptavidin-conjugated Alexa-Flour 594 antibody after incubation with biotiny- 
lated anti-secondary. 
Estimation of the proportion of the adnexa in the dermis. The volume density 
of the adnexa in the dermis Vy (adnexa, dermis) is the ratio of the total volume of follicles 
and sebaceous glands to the total volume of the dermis. This ratio can be estimated 
by point counting on plane sections””*'. For this purpose, one histological section 
was randomly sampled from each animal (n = 6 per species). The corresponding 
micrographs were projected ona screen, anda transparent test grid bearing a square 
lattice of points overlaid with random positions on each projected image: 


Vv(adnexa, dermis) = P, /Pa 


The total number of test points falling on profiles of the adnexa P, and on the entire 
dermis P,, was counted. An estimator of the volume density of the adnexa was then 
calculated that is unbiased (that is, there is no systematic error due to the sampling 
or counting procedures) provided that the plane section and the test grid are 
randomly positioned. 

Statistical analysis. To test for significant differences between percentage adnexa, 
the Student’s two-tailed t-test function in Excel was used to calculate P values. 
Alpha was set at 0.05 and data are mean + standard error. 

Descriptive statistics. For strength measure, n = 6 for M. musculus and n = 5 for 
A. kempiand A. percivali. The following data are mean + s.e.m. Mean failure strain 
(e_): Mus (0.61 + 0.05), A. kempi (0.14 + 0.05) and A. percivali (0.08 + 0.02). Mean 
tensile strength (MPa): Mus (2.30.19), A. kempi (0.15 + 0.04), A. percivali 
(0.08 + 0.03) and Acomys (both species) (0.11 + 0.03). Mean toughness (MJ m-): 
Mus (0.77 + 0.08), A. kempi (0.012 + 0.006), A. percivali (0.005 + 0.003) and 
Acomys (both species) (0.01 + 0.004). The percentage of adnexa, n = 6 for each 
species. A. percivali (55.61 + 4.28%) compared with M. musculus (43.65 + 4.62%) 
(t = 1.9, P= 0.043). The percentage of wound contraction, n = 12 for A. percivali 
(95.5 + 0.7%). 
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Influenza viruses take a yearly toll on human life despite efforts to 
contain them with seasonal vaccines. These viruses evade human 
immunity through the evolution of variants that resist neutraliza- 
tion. The identification of antibodies that recognize invariant 
structures on the influenza haemagglutinin (HA) protein have 
invigorated efforts to develop universal influenza vaccines. 
Specifically, antibodies to the highly conserved stem region of 
HA neutralize diverse viral subtypes. These antibodies largely 
derive from a specific antibody gene, heavy-chain variable region 
IGHV1-69, after limited affinity maturation from their germline 
ancestors’”, but how HA stimulates naive B cells to mature and 
induce protective immunity is unknown. To address this question, 
we analysed the structural and genetic basis for their engagement 
and maturation into broadly neutralizing antibodies. Here we 
show that the germline-encoded precursors of these antibodies 
act as functional B-cell antigen receptors (BCRs) that initiate sub- 
sequent affinity maturation. Neither the germline precursor of a 
prototypic antibody, CR6261 (ref. 3), nor those of two other nat- 
ural human IGHV1-69 antibodies, bound HA as soluble immuno- 
globulin-G (IgG). However, all three IGHV1-69 precursors engaged 
HA when the antibody was expressed as cell surface IgM. HA trig- 
gered BCR-associated tyrosine kinase signalling by germline trans- 
membrane IgM. Recognition and virus neutralization was 
dependent solely on the heavy chain, and affinity maturation of 
CR6261 required only seven amino acids in the complementarity- 
determining region (CDR) H1 and framework region 3 (FR3) to 
restore full activity. These findings provide insight into the initial 
events that lead to the generation of broadly neutralizing antibodies 
to influenza, informing the rational design of vaccines to elicit such 
antibodies and providing a model relevant to other infectious dis- 
eases, including human immunodeficiency virus/AIDS. The data 
further suggest that selected immunoglobulin genes recognize spe- 
cific protein structural ‘patterns’ that provide a substrate for further 
affinity maturation. 

Antibodies to the conserved stem region of HA block membrane 
fusion and prevent productive infection by diverse influenza viruses. 
The structural basis of HA stem recognition of two such monoclonal 
antibodies, CR6261 and F10, has been defined**. These antibodies 
bind with nanomolar affinity to a highly conserved hydrophobic 
groove at the interface of HA1 and HA2, the two polypeptides that 
constitute HA, and neutralize several influenza group 1 subtypes 
including H1, H5, H6, H8 and H9 (ref. 3). Among the anti-stem 
antibodies isolated so far, the IGHV1-69 gene segment is observed 
more frequently than expected by chance’. To understand the 
development of these antibodies, we studied the prototypic IGHV1- 
69-derived broadly neutralizing antibody CR6261, isolated by phage 
display of human immunoglobulin genes, as well as two others cloned 
from single human cells, FE53 and 1009-3B05 (refs 1-3). Influenza 
IGHV1-69-based broadly neutralizing antibodies undergo a relatively 
low degree of somatic mutation (an average of 14 amino acids in the 


heavy chain, n = 9)'* (Fig. 1a). We first asked whether their germline 
antibody precursors might recognize HA with measurable affinity. 
Notably, the IGHV1-69 germline ancestors of CR6261, FE53 and 
1009-3B05 failed to bind HA as soluble IgG, even at concentrations 
as high as 100 ug ml! (Fig. 1b). 

To define the molecular basis for affinity maturation of these 
antibodies, we analysed the respective contributions of heavy and light 
chains to antigen recognition. We compared chimaeric antibodies that 
consisted of somatic heavy (sH) and germline light (gL) chains to the 
mature antibody (sHsL). The chimaeric sHgL of all three antibodies 
bound to a recombinant H1 HA with affinities similar to their respect- 
ive matured sHsL (Fig. 1b). Maturation of the light chain thus does not 
affect binding to H1 HA. Rather, somatic mutation of the IGHV1-69 
heavy chain gene alone mediates the increase in binding affinity. This 
finding is consistent with the lack of light-chain interaction observed 
in the crystal structures of both CR6261-HA and F10-HA com- 
plexes’, suggesting that IGHV1-69 light-chain somatic mutation 
may be incidental and is not required for heavy-light-chain pairing 
or improved neutralization function. 

We next investigated the minimum requirements of heavy-chain 
loop maturation that lead to somatic activity. We chose to focus this 
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Figure 1 | Somatic maturation of heavy chain confers HA reactivity. 
a, Amino acid alignment of somatic CR6261 heavy-chain variable region 
(including the D and J regions) with JOINSOLVER” predicted germline 
precursor (Kabat*® convention). Somatic mutations are shown in red. Those 
incorporated into germline variants are underlined. Mutations T28P and S30R 
are in close proximity to the Kabat-defined CDR H1 and are CR6261 contact 
residues, and therefore defined as CDR H1 somatic mutations in the remaining 
text. b, ELISA binding of somatic (sHsL, filled circles), respective germline 
(gHgL, open circles), and chimaeric (sHgL, half-filled circles) indicated HA- 
stem-specific antibodies to H1 1999 NC. OD, optical density. Error bars 
represent standard deviations of the mean for each antibody concentration. 
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analysis on CR6261, the first |GHV1-69 anti-stem broadly neutralizing 
antibody discovered, for which the structure is known. The sH of 
CR6261 differs from the germline heavy (gH) chain by only 14 amino 
acids (11.6%) (Fig. 1a). Four of these amino acids (Pro 28, Arg 30, 
Lys58 and Phe74) contact the HA stem in the crystal structure 
(Fig. 2a). We generated several CR6261 germline variants by intro- 
ducing residues from sH, including CDR H1, CDR H2, CDR H3 or 
FR3, into gH, singly or in combination. The resulting antibodies were 
analysed for their ability to bind HA and neutralize virus in an HA- 
pseudotyped lentiviral system® (Table 1). Tested individually, only the 
somatic CDR H1 (sCDR H1: Thr28Pro/Ser30Arg) increased binding, 
but it was more than 100-fold less than sH. Furthermore, sCDR H1 
alone did not confer detectable activity in our neutralization assay 
(Table 1). Although sFR3 alone did not improve the potency of germ- 
line CR6261, sCDR H1 and sFR3 together restored full activity for both 
binding and neutralization against HIN1 and H5N1 viruses (Table 1, 
Fig. 2b and Supplementary Fig. 1). This finding suggests that the 
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Figure 2 | CDR H1 maturation confers CR6261 neutralization activity. 

a, CR6261 interaction with H1 A/South Carolina/1/1918 (1918 SC) HA stem 
(grey, PDB accession 3GBN). Side chains with at least 10 A’ of interaction are 
depicted as sticks, with somatically mutated residues coloured red. b, ELISA 
binding and neutralization of H1 1999 NC by mature (filled circles) and sCDR 
H1/sFR3 germline-mutated CR6261 (filled squares) Error bars represent 
standard deviations of the mean for each antibody concentration. c, CDR H1 of 
CR6261 uncomplexed (blue) or with HA (light blue, PDB accession 3GBN) 
compared with unmutated CDR H1 from 47e (cyan, PDB accession 1RZI); 
mutated residues are in red (CR6261) or pink (47e). A non-crystallographic 
symmetry (NCS)-averaged electron density map (grey) is contoured at 1.0c. 
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mutation of only a small number of germline residues enables potent 
neutralization. 

Because CDR HI] and FR3 sit adjacent to one another in the folded 
protein, we examined the protein structure further. In most IGHV 1-69 
antibodies with no CDR H1 mutations, Phe 29 is buried in the ‘canonical’ 
conformation of the CDR H1 (ref. 7). In contrast, the somatically 
mutated CDR H1 of CR6261 flips the hydrophobic residue Phe 29 
out, placing this residue in contact with HA. To determine whether 
the position of Phe 29 in CR6261 is due to maturation or binding, we 
solved the crystal structure of unliganded CR6261(sHgL), and com- 
pared it to both a IGHV1-69 antibody with no mutations in CDR H1, 
and HA-liganded CR6261 (Fig. 2c). In the unliganded structure, 
Phe 29 is exposed on the surface, suggesting that the somatic mutations 
Thr28Pro and Ser30Arg lead to its placement there—a hydrophobic 
residue from the interior of the antibody is made available to contribute 
to the interface by mutation of adjacent residues. The CDR H1 of 
IGHV1-69 in both germline and affinity-matured states is not well 
ordered unless bound to antigen. However, the CDR H1 loop in 
CR6261 favours the non-canonical conformation, with Phe 29 
exposed, whether or not HA is bound. The solved structure suggests 
that the main consequence of somatic CDR H1 mutation is to favour 
this non-canonical state. Comparison of these structures also explains 
the synergistic effect of mutations in CDR H1 and FR3; somatic muta- 
tion in FR3 introduces Phe 74 on the surface of CR6261 adjacent to 
Phe 29 in CDRH1, and these two sets of mutations increase the hydro- 
phobicity of the contact surface more than either alone. 

The minimal somatic mutation of the CR6261 germ line required to 
confer full activity made it surprising that the germline antibody did 
not recognize HA with any measurable affinity in solution. We pro- 
posed that antibody recognition by the low-affinity germline IGHV1- 
69 revertant requires a more physiological presentation, for example, 
on the cell surface where such antibodies would normally be expressed. 
A naturally bivalent transmembrane IgM form of the CR6261 germ 
line was transfected into human embryonic kidney 293F cells. Cell 
surface expression was confirmed by staining with an anti-lambda- 
chain antibody. Fluorescent-labelled HA stained CR6261-germline- 
expressing cells but not mock transfected cells, as measured by flow 
cytometry (Fig. 3a). In contrast, no binding was seen in solution with 
CR6261 germline Fab monomer, bivalent IgG, or decameric IgM 
antibody derivatives (Supplementary Fig. 2). The specificity of 
CR6261 germline binding to the HA stem was further confirmed by 
its minimum reactivity to a mutant HA probe that blocks anti-stem- 
antibody binding (Supplementary Fig. 3). FE53 and 1009-3B05 each 
showed similar membrane-dependent recognition (Fig. 3a). Our 
results suggest that membrane presentation of antibody provides a 
mechanism by which low-affinity germline B cells achieve sufficient 
binding to recognize antigens before affinity maturation, and are con- 
sistent with experiments showing that two-dimensional confinement 
and clustering at the plasma membrane can increase the apparent 
affinity of cell surface receptors*”. 

Two amino acids at the tip of CDR H2, Ile 53 and Phe 54, seem to be 
an anchor by which germline IGHV 1-69 might attach to HA (Fig. 2a). 
Indeed, mutation of these two amino acids to alanines abolished HA 
germline binding for all VH-1-69 antibodies (Fig. 3a). Individual muta- 
tion of Ile 53 and Phe 54 also abolished binding of unmodified/native 
HA trimer (binding in the presence of 6’-sialyllactose to prevent sialic 
acid mediated cell adhesion; Fig. 3b and Supplementary Fig. 4). In the 
CR6261-HA co-crystal structure, CDR H2 inserts into a hydrophobic 
pocket between HAI and HA2 (ref. 3) (Fig. 3c). These data suggest that 
this interaction of the germline CDR H2 has a central role in recog- 
nition and engagement of the HA stem. 

To determine whether binding of antigen to germline antibodies 
displayed on the cell surface could induce BCR activation and signal- 
ling, the transmembrane IgM version of the CR6261 germline was 
transfected into a transformed human B cell capable of expressing a 
functional BCR’®"’, a Ramos cell clone whose endogenous IgM is not 
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Table 1 | HA binding and pseudovirus neutralization analysis 


H1 1999 NC 
Germline revertants Mutations Titre (ug ml~1) IC50 (wg ml~*) 
Somatic Full 0.00133 1.9 
Germline None >100 >25 
sCDR H1 T28P S30R 0.150 >25 
sCDR2 H2 A57T N58K Q61P >100 >25 
sFR3 D73E F74S A75T G76S V78A 63.2 >25 
sCDR H3 V100L >100 >25 
sCDR H1/sCDR H2 T28P S30R A57T N58K Q61P 0.150 >25 
sCDR H1/sFR3 T28P S30R D73E F74S A75T G76S V78A 0.00133 2.0 
sCDR H1/sCDR H3 T28P S30R V1OOL 0.250 >25 
sCDR H2/sFR3 A57T N58K Q61P D73E F74S A75T G76S V78A 0.950 >25 
sCDR H2/sCDR H3 A57T N58K Q61P V1OOL 0.150 >25 
sFR3/sCDR H3 D73E F74S A75T G76S V78A V100L >100 >25 
sCDR H1/sCDR H2/sFR3 T28P S30R A57T N58K Q61P D73E F74S A75T G76S V78A 0.00133 14 


Binding was determined by ELISA endpoint dilution titres. Half-maximum inhibitory concentration (ICso) values represent 50% neutralization of pseudotyped lentiviruses by the respective antibodies. Mutations 
incorporated into germline-revertant antibodies correspond to the variable regions indicated. 


expressed (Methods). We found that proteoliposome-arrayed HA 
(Supplementary Fig. 5) selectively triggered tyrosine phosphorylation 
of BCR effector proteins HS1 and SLP-65 (ref. 12) (Fig. 3d). Signalling by 
HA was comparable to that induced by IgM cross-linking. Furthermore, 
mutation of Ile53Ala/Phe54Ala in CDR H2 abolished the response to 
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HA stimulation, consistent with the binding data and confirming the 
importance of the germline CDR H2 structure in naive B-cell activation. 
These findings indicate that engagement of low-affinity germline 
IGHV1-69 antibody can lead to BCR activation, thus triggering further 
maturation and the subsequent humoral immune response. 
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Figure 3 | HA engages and activates membrane-presented germline 


antibody. a, 293F cells expressing membrane IgMs (VRCO1 (grey), IGHV1-69 


germ lines (wild-type and Ile53Ala/Phe54Ala mutants, coloured lines)) 
exposed to HA with an N-linked glycan introduced to block the sialic-acid- 
binding site (ARBS) (red) and anti-light-chain expression controls (purple) 


analysed by flow cytometry. b, Unmodified HA trimer also engages germline 
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(30 mM 6'-sialyllactose present) by flow cytometry. c, CR6261 CDR H2 
interactions with 1918 SC HA stem (PDB accession 3GBN). d, HA 
proteoliposomes selectively activate tyrosine phosphorylation (pY) by CR6261 
germline BCR (P < 0.0002, ANOVA); wild-type and mutant CDR H2 receptor 
activity differed for HA (asterisk) but not for anti-IgM (Tukey’s HSD, 

a = 0.05)). Presented are the mean values and standard errors of pY intensity. 


We have shown in this study that IGHV1-69 antibodies, with no 
mutation of the germline-encoded VH sequence, engage influenza HA 
with sufficient affinity to trigger B-cell activation. In all cases, engage- 
ment depends on membrane presentation of antibodies with the same 
structural determinant—two specific hydrophobic residues in the 
CDR H2. Mutational analysis further revealed that just a few muta- 
tions convert a germline IGHV1-69 into an antibody with the full 
activity of CR6261. Taken together, these results suggest that the 
IGHV1-69 germ line is poised to form, with a varied set of CDR H3 
sequences, broadly neutralized antibodies directed against the highly 
conserved stem of influenza HA. 

Having an antibody with the inherent potential to recognize a 
common feature of influenza virus would seem to offer obvious evolu- 
tionary advantages. Although neutralizing antibodies against other 
viruses such as human immunodeficiency virus (HIV), SARS and 
hepatitis C virus also use the IGHV1-69 gene for recognition’*"’, these 
interactions differ. For example, 17b antibody binding to the CD4- 
induced site on HIV-1 Env“ and 8066 antibody binding to HIV-1 
gp41 (ref. 19) orient the CDR H2 into a hydrophobic cleft whereas 
others such as the SARS neutralizing antibody M396 orient CDR H2 
into a more hydrophilic site’®. In each case, the actual fold recognized 
by the CDR H2 is distinct from that recognized on the HA stem. 
Together, the data suggest that the IGHV1-69 CDR H2 motif is par- 
ticularly well adapted to recognize the specific protein fold that is 
highly conserved on the stem of diverse influenza virus HAs. 

This germline VH gene, although expressed as part of the adaptive 
immune response, may therefore serve as a primordial pattern recog- 
nition receptor, structurally adapted to participate in recognition of 
such hydrophobic grooves’. As no other heavy-chain V genes have a 
hydrophobic CDR H2 (ref. 14), we speculate that influenza may have 
exerted selection pressure leading to retention of this Vx gene. It is also 
conceivable that the adaptive immune system retains this innate 
capacity for hydrophobic contact to facilitate the generation of high- 
affinity antibodies after only a limited degree of somatic mutation. This 
situation contrasts with the considerable somatic mutation (31%) 
observed in broadly neutralizing antibodies directed to the CD4- 
binding site of HIV’, a more recently evolved virus for which there 
has been less opportunity to select for protective antiviral genes. 

The majority of current vaccine-elicited influenza antibodies are 
directed to the globular head region of viral surface glycoprotein 
HA, which undergoes considerable antigenic drift to evade the human 
immune system. Thus, influenza vaccination requires annual assess- 
ment of the strains likely to circulate in the coming year to generate 
protective immunity for the world population. Consequently, there is 
great interest in the development of a universal influenza vaccine*’”’. 
Antibodies targeting the highly conserved HA stem epitope represent 
the vast majority of all broadly neutralized antibodies isolated against 
influenza so far’*. We have previously demonstrated that it is possible 
to elicit CR6261-like stem-directed antibodies by vaccination”. 
Despite sequence variations in the CDRs and FR regions, IGHV 1-69 
antibodies evolve with a relatively small number of somatic mutations. 
At the same time, robust induction of these neutralizing antibodies in 
humans remains a challenge. Analysis of current genomics data (1000 
Genome Project) reveals a polymorphism in the IGHV1-69 allele at the 
contact site in CDR H2 where Leu replaces Phe 54 at a predicted 
homozygous frequency of 13% in the general population. This substi- 
tution abolishes HA recognition by all three germline antibodies 
(Supplementary Fig. 6). This finding underscores the remarkable spe- 
cificity in the CDR H2 contact and raises the possibility that some 
vaccine recipients may not mount a IGHV1-69 response. At the same 
time, ~30% of stem antibodies isolated until now derive from other VH 
gene segments’, and it remains possible that such individuals might 
develop responses from other IgG loci. Taken together, knowledge of 
the initial engagement of the HA stem with the IGHV1-69 V gene, its 
subsequent BCR stimulation and somatic mutation will aid in the 
rational design of universal influenza vaccines. 
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The genes encoding wild-type HA proteins (H1 A/New Caledonia/20/1999 (1999 
NC) and H5 A/Indonesia/05/2005 (2005 IND)) and somatically mutated and 
inferred germline antibodies of CR6261, FE53, and 1009-3B05 were synthesized. 
Somatically mutated CR6261 germline revertants and germline CDR H2 Ala 
mutations of Ile and Phe were constructed by introducing mutations to germline 
heavy chains by site-directed mutagenesis. Plasmids encoding these proteins were 
transfected into the human embryonic kidney cell line 293F and isolated from 
expression supernatants 72-96h after transfection. HA trimeric proteins were 
purified as previously described**. Soluble Fab, IgG and IgM antibodies were 
purified using HisTrap, Protein G and IgM affinity columns, respectively, with 
additional gel-exclusion chromatography performed on IgM samples (GE 
Healthcare). 

The purified antibody variants (1.7 X 10 *-100 1g ml~') were assayed for 
binding to H1 1999 NC and in some cases to H5 2005 IND by enzyme-linked 
immunosorbent assay (ELISA) with purified trimeric HA proteins. The various 
antibodies were detected by peroxidase-conjugated goat anti-human IgG unless 
otherwise noted. Endpoint dilutions were determined from nonlinear fit dose- 
response curves using a detection limit of 4x background absorbance. CR6261 
variants (0.39-25 jig ml~') were also assayed for neutralization of pseudotyped 
recombinant lentiviruses expressing wild-type HA with the corresponding 
neuraminidase (NA) with a luciferase reporter gene as previously described”. 

The crystal structure of CR6261 sHgL Fab at 2.85 A resolution was determined 
by molecular replacement (Methods). 

For membrane presentation of antibody, IGHV1-69 germ lines were expressed 
in membrane IgM format in 293F cells, which were then subject to a FACS-based 
HA cell surface binding assay 72 h after transfection (Methods). For BCR trigger- 
ing, membrane IgM germ lines were transiently expressed in an IgM-negative 
Ramos B cell line and exposed to proteoliposome-arrayed H1 1999 NC 
(Methods). Activation was assessed by extent of tyrosine phosphorylation using 
the 4G10 pY antibody as described’’. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Plasmid construction. Genes encoding HA proteins A/New Caledonia/20/1999 
(H1 1999 NC; GenBank AY289929) and A/Indonesia/05/2005 (H5 2005 IND; 
GenBank ABW06108.1), the corresponding neuraminidase proteins, monoclonal 
antibodies CR6261, FE53, 1009-3B05 and the respective inferred germline 
antibodies’ were synthesized using human preferred codons as described’ by 
GeneArt (Regensburg, Germany). The germ lines were predicted using 
JOINSOLVER” and were created from the following segments: IGHV1-69*01, 
IGHD2-02*02, IGHJ6*03 (CR6261); IGHV1-69*01, IGHD2-08*02, IGH]4*02 
(FE53); and IGHV1-69*06, IGHD2-08*01, IGHJ3*02 (1009-3B05). The genes 
were cloned into a CMV/R vector for efficient expression in mammalian cells”. 
Soluble and membrane IgM and Fab genes were generated by overlapping PCR 
(IgG and IgM class switch region was identified through sequence alignment of 
GenBank available sequences for cloning of soluble and membrane-bound IgM 
antibodies). Mutations to germline antibodies were introduced using the 
QuikChange Site-Directed Mutagenesis kit (Agilent Technologies). The following 
mutations of germline CR6261 heavy chain were generated to probe maturation: 
CDR H1 (Thr28Pro and Ser30Arg), CDR H2 (Ala57Thr, Asn58Lys and 
Gln61Pro), CDR H2-61P that lacks the mutation at amino acid 61 (Ala57Thr 
and Asn58Lys), CDR H3 (Vall100Leu) and FR3 (Asp73Glu, Phe74Ser, Ala75Thr, 
Gly76Ser and Val78Ala). Germline variant antibodies constructed to probe 
engagement correspond to the following mutations: CDR H2 mutations 
Tle53Ala, Phe54Ala and Phe54Leu. 

Cell surface binding. To evaluate the role that cell surface presentation has in the 
recognition of low-affinity antigens at the initial contacting germline stage, soluble 
germline antibodies were expressed as receptor IgMs on the surface of 293F cells 
(293fectin Reagent, Invitrogen). For the HA ligand, an N-linked glycan (HA1 
Argl92Thr, H3 numbering) was introduced into the receptor binding site 
(ARBS) of wild-type or HA stem binding mutant (Ile45Arg/Thr49Arg mutations 
in HA2, Astem) H1 1999 NC to prevent sialic-acid-mediated cell binding. The 
resulting avi-tagged trimer was biotinylated, purified by size-exclusion chromato- 
graphy, and labelled with a streptavidin-linked PE fluorophore”. Two days after 
transfection, 2 X 10° cells expressing membrane IgM versions of CR6261 germ 
line (gl), FE53 gl, 1009-3B05 gl (all + CDR H2 mutations Ie53Ala and Phe54Ala), 
or VRCO1 (mock) were placed on ice, stained with violet fluorescent reactive dye 
(ViViD, catalogue no. L34955, Invitrogen) for 30 min in PBS, washed, and then 
incubated (1h, 4°C, in PBS containing 1% FBS) with either: PE-HA ARBS (4 1g 
ml '); PE-HA ARBS Astem (4 lug ml '); PE-anti-human kappa chain (catalogue 
no. 12-9970-42, eBioscience) (FE53 gl, 1009-3B05 gl); PE mouse anti-human 
lambda chain (catalogue no. 555797, BD Biosciences) (CR6261 gl). For unmodi- 
fied/native HA binding, cells were incubated with 41g ml‘ trimer along with 
30 mM 6'-sialyllactose to prevent sialic-acid-mediated cell binding (Supplemen- 
tary Fig. 4) (1h, 4°C, in PBS). Cells were washed twice and then fixed in PBS 
containing 0.5% PFA. ViViD-negative cell surface PE intensity was quantified by 
flow cytometry (ViViD excitation at 407 nm, bandpass filter 450/50 nm; PE excita- 
tion at 532 nm; BD LSR II, BD Biosciences). 

HA proteoliposome preparation. This platform was used to provide both the 
multivalent antigen display needed to oligomerize and activate the BCR” and the 
synaptic engagement architecture that underscores naive B-cell stimulation by 
antigen-presenting cells in the lymph node germinal centre”. One gram of 
1,2-dioleoyl-sn-glycero-3-phosphocholine:1,2-dioleoyl-sn-glycero-3-[(N-(5-amino- 
1-carboxypentyl)iminodiacetic acid)] succinyl (Avanti Polar Lipids) in a 1:1 molar 
ratio was evaporated under a stream of nitrogen for 1 h. The dry lipid film was then 
rehydrated in 1,000 pl of liposome buffer (50 mM HEPES, 150 mM NaCl, pH 7.25 
(HBS)) and shaken for 40 min, all the time being heated above the membrane 
melting temperature of the lipid mixture. The resulting homogeneous suspension 
was subjected to ten freeze-thaw cycles and then extruded 21 times through a 
100-nm pore polycarbonate membrane using the Avanti mini-extruder (Avanti 
Polar Lipids). HA proteoliposomes were produced by incubating the resultant 
liposomes with His-tagged ARBS H1 1999 NC trimer for 1 h at room temperature 
(HA trimer:lipid molar ratio of 1:900). The sample was then adjusted to 15% 
iodixanol (in 1.25 ml HBS) and overlaid with 1.75 ml, 0.5 ml and 0.5 ml of 10%, 
2.5% and 0% iodixanol in HBS, respectively. Samples were then centrifuged at 
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200,000g in a TH660 rotor (Sorvall) for 2h. The proteoliposome fraction, which 
concentrated at the 2.5-0% iodixanol interface (Supplementary Fig. 5), was col- 
lected and dialysed overnight (Slide-A-Lyzer Dialysis Cassette, 10000 MWCO, 
catalogue no. 66380, Thermo Scientific) to remove density gradient material. 
After dialysis, proteoliposomes were pelleted (200,000g, 2 h, TH660 rotor), resus- 
pended in HBS and measured by the Pierce BCA protein assay (catalogue no. 
23227, Thermo Scientific). 

B-cell activation. Ramos B lymphocytes (ATCC) were initially stained with fluor- 
escently conjugated anti-lambda chain and anti-IgM monoclonal antibodies 
(mouse PE-anti-human lambda chain, catalogue no. 555797, BD Biosciences; 
APC-anti-human IgM, catalogue no. 314510, BioLegend) to isolate a B-cell clone 
with no functional BCR at the cell surface. Such a clone was identified by flow 
cytometry (BD FACSAria, BD Biosciences), expanded and re-sorted for IgM and 
light-chain negativity for a further six generations before use. For activation 
studies, 2 X 10° IgM-negative cells were transfected (Cell Line Nucleofector Kit 
V, Lonza Group) with wild-type or Ile53Ala and Phe54Ala (CDR H2 mutated) 
versions of germline CR6261 receptor IgM. Twenty-four hours after transfection, 
cells were exposed to either 0.5 1g tll’ mouse anti-human IgM F(ab’), (catalogue 
no. 9023-01, SouthernBiotech) in HBS as a positive control or HA proteolipo- 
somes (HA trimer diluted to 2.5 11M in HBS). After 15 min exposure at room 
temperature, cells were placed at 4 °C, washed two times with HBS and then lysed 
for 10 min in lysis buffer (Cell Lysis Buffer, Cell Signaling Technology) supple- 
mented with protease inhibitors (Complete Protease Inhibitor Cocktail, catalogue 
no. 13352700, Roche Applied Science). After SDS-PAGE, BCR activation was 
assessed by western blot analysis of the cell lysate: 4G10 pY (catalogue no. 05- 
0321, Millipore) reactivity to phosphotyrosine-SLP-65 (p65) and phosphotyrosine 
HS1 (p75) as described’**®*’. Total phosphotyrosine intensity from three inde- 
pendent experiments was measured by densitometry (Image Processing in Java 
(Image J) software with curve area density calculation performed in Microsoft 
Excel). Phosphotyrosine intensity was standardized to the level of cell lysate actin 
(monoclonal anti-B-actin, catalogue no. A5316, Sigma), with the background 
value for BCR activation being defined as the extent of stimulation in untrans- 
fected IgM-negative Ramos exposed to 0.5pgpl ' mouse anti-human IgM 
F(ab’)>. 

Crystallography. The Fab fragment of CR6261 sHgL was concentrated to 7 mg 
ml | in PBS and crystallized in a hanging drop over a reservoir containing 
100 mM imidazole pH 6.5, 17.5% polyethylene glycol 8,000, and 3% 2-methyl- 
2,4-pentanediol. Crystals were cryoprotected by transfer through paratone-N and 
vitrified in liquid nitrogen. Data to 2.85 A resolution were collected at Southeast 
Regional Collaborative Access Team (SER-CAT) 22-ID beamline at the Advanced 
Photon Source, Argonne National Laboratory. The structure was determined by 
molecular replacement using the program PHASER with the structure of CR6261 
bound to influenza HA (3GBN) as the search model, and refined using the pro- 
gram PHENIX (Supplementary Table 1). The asymmetric unit contains eight 
copies of the Fab. The CDR H1 was deleted from the model before refinement. 
An electron density map calculated before modelling the CDR H1 showed only 
weak density corresponding to this loop; however, an NCS-averaged map calcu- 
lated in COOT, which is shown in Fig. 2c, indicated clearly that the CDR H1 loop 
adopts a non-canonical conformation similar to that observed in the co-crystal 
structure with HA. 
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Genotoxic consequences of endogenous aldehydes 
on mouse haematopoietic stem cell function 


Juan I. Garaycoechea'*, Gerry P. Crossan'*, Frederic Langevin', Maria Daly', Mark J. Arends? & Ketan J. Patel! 


Haematopoietic stem cells (HSCs) regenerate blood cells through- 
out the lifespan of an organism. With age, the functional quality of 
HSCs declines, partly owing to the accumulation of damaged 
DNA‘ *. However, the factors that damage DNA and the protective 
mechanisms that operate in these cells are poorly understood. We 
have recently shown that the Fanconi anaemia DNA-repair path- 
way counteracts the genotoxic effects of reactive aldehydes**. Mice 
with combined inactivation of aldehyde catabolism (through 
Aldh2 knockout) and the Fanconi anaemia DNA-repair pathway 
(Fancd2 knockout) display developmental defects, a predisposition 
to leukaemia, and are susceptible to the toxic effects of ethanol—an 
exogenous source of acetaldehyde*. Here we report that aged 
Aldh2~’~ Fancd2~’~ mutant mice that do not develop leukaemia 
spontaneously develop aplastic anaemia, with the concomitant 
accumulation of damaged DNA within the haematopoietic stem 
and progenitor cell (HSPC) pool. Unexpectedly, we find that only 
HSPCs, and not more mature blood precursors, require Aldh2 
for protection against acetaldehyde toxicity. Additionally, the 
aldehyde-oxidizing activity of HSPCs, as measured by Aldefluor 
stain, is due to Aldh2 and correlates with this protection. Finally, 
there is more than a 600-fold reduction in the HSC pool of mice 
deficient in both Fanconi anaemia pathway-mediated DNA repair 
and acetaldehyde detoxification. Therefore, the emergence of bone 
matrow failure in Fanconi anaemia is probably due to aldehyde- 
mediated genotoxicity restricted to the HSPC pool. These findings 
identify a new link between endogenous reactive metabolites and 
DNA damage in HSCs, and define the protective mechanisms that 
counteract this threat. 

Reactive aldehydes, such as acetaldehyde, are by-products of meta- 
bolism. These molecules readily react with DNA and proteins within 
cells, leading to both mutagenesis and cell death*. Recent work has 
identified a critical requirement for the aldehyde-catabolising enzyme 
Aldh2 and the Fanconi anaemia DNA-repair pathway in protecting 
against damage caused by these toxic molecules. The majority of mice 
deficient in both acetaldehyde catabolism (Aldh2~/—) and the Fanconi 
anaemia DNA-repair pathway (Fancd2 ‘~), spontaneously succumb 
to acute T-cell leukaemia*. Flow cytometric analysis revealed that the 
maturation of haematopoietic lineages was intact (Supplementary 
Fig. 1). However, double-mutant mice, without leukaemia, consis- 
tently have reduced numbers of all blood constituents and bone 
marrow cellularity, compared to wild-type controls (Supplementary 
Fig. 2). Furthermore, a small proportion of these double-mutant mice 
developed anaemia with age (n = 6/29) (Fig. 1a). These mice showed 
profound pancytopenia, and histological analysis of the bone marrow 
revealed hypocellularity (Fig. 1b, c and Supplementary Fig. 3a). 
Furthermore, these mice showed extramedullary haematopoiesis 
(Supplementary Fig. 3b). These clinical and pathological features are 
consistent with aplastic anaemia—a hallmark of human Fanconi 
anaemia but, until now, not a feature observed in Fanconi anaemia 
knockout mice. Immunohistochemistry revealed that the bone 


marrow of the aged Aldh2-/~ Fancd2~/~ mice had increased numbers 
of cells that were positive for y-H2AX, a well-established marker of 
DNA double-strand breaks°. There was also an increased proportion 
of cells undergoing apoptosis, as measured by cleaved caspase-3 
(Fig. 1c). The data in Fig. 1d, e show that there was a significant 
induction in y-H2AX, as measured by flow cytometry, within the total 
bone marrow of aged aplastic double-mutant mice. However, y-H2AX 
induction within the Lineage (Lin) ~ c-Kit* Sca-1* (LKS) population, 
which is enriched for HSPCs, was much greater. Analysis of 
young Aldh2~’~ Fancd2~‘~ mice indicated that the induction in 
y-H2AX was greater in long-term HSCs (LT-HSCs; Lin” c-Kit* 
Sca-1* Flt3” CD34—) than in other haematopoietic progenitors (Sup- 
plementary Fig. 4). Taken together this suggests that there is a require- 
ment for acetaldehyde catabolism and Fanconi anaemia pathway- 
mediated DNA repair for blood homeostasis, and that this activity 
may be restricted to a subset of haematopoietic cells. 

We therefore set out to test systematically how haematopoietic cells 
at different stages of maturity respond to exogenous acetaldehyde— 
the key substrate of Aldh2. In the first instance we found that B cells, 
erythroid progenitor and granulocyte-macrophage progenitor cells 
from three different strains of mice deficient in the Fanconi anaemia 
DNA-repair pathway (Fanca ’~, see Supplementary Fig. 5; Fancd2/~, 
ref. 7;and Fancp ‘~ (Fancp is also known as S/x4); ref. 8) were all more 
sensitive to acetaldehyde than congenic controls (Fig. 2a). Surprisingly, 
when we tested the resistance of haematopoietic cells deficient in both 
Aldh2 and Fancd2, we noted that all three lineages tested were no more 
sensitive than those deficient in Fancd2 alone (Fig. 2b). In contrast, we 
have previously shown that Aldh2’~ Fancd2-’~ mice developed bone 
marrow failure after ethanol exposure, but that this did not occur in 
single-mutant mice*. We speculated that this was because aldehyde 
catabolism and the Fanconi anaemia repair pathway specifically pro- 
tected the HSC compartment, and therefore aldehyde accumulation 
would be most toxic to this vital cell population. Progressive attrition of 
the HSC pool is considered to be the reason why most Fanconi anaemia 
patients develop bone marrow failure’. 

To test this, we modified the classic colony-forming unit spleen 
(c.fu.-S) assay, which is used to quantify short-term HSCs (ST- 
HSCs), a subset of HSPCs'*"’. As shown in Fig. 2c, we exposed the 
bone marrow of mutant and wild-type mice to acetaldehyde in vitro, 
before transplantation into lethally irradiated recipients. Ten days after 
transplantation we killed the mice and counted the number of spleen 
colonies. This assay therefore allows us to measure directly the survival 
of ST-HSCs after exposure to acetaldehyde. The data in Fig. 2d show 
that ST-HSCs use both aldehyde detoxification and Fanconi anaemia 
pathway-mediated DNA repair to counteract the genotoxic effects of 
acetaldehyde. This is in contrast to the more mature haematopoietic 
cells, in which Aldh2 is dispensable for resistance to acetaldehyde. 

For many years it has been appreciated that potent aldehyde 
dehydrogenase activity is associated with certain normal and cancer 
stem cells'*'*. This activity can be quantified by the commercial assay 
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known as Aldefluor'®. Briefly, cells are incubated with a fluorescent 
aldehyde substrate that, when oxidized by aldehyde dehydrogenases, 
cannot efflux from the cell (Fig. 3a). The amount of fluorescence 
reflects the aldehyde dehydrogenase activity in that cell. The addition 
of 4-diethylaminobenzaldehyde (DEAB), which is an inhibitor of Aldh 
enzymes, prevents enzymatic oxidation and the accumulation of this 
product, therefore providing a negative control for fluorescence. In 
human bone marrow, the Aldefluor activity of HSCs is sufficient to 
enable their identification and isolation'®. However, the restriction of 
this aldehyde dehydrogenase activity to the HSC pool seems to be less 
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Figure 1 | Aged Aldh2~’~ Fancd2~’~ mice succumb to bone marrow 
failure. a, Kaplan-Meier curve showing the survival of a cohort of 
Aldh2~‘~ Fancd2~’~ mice and congenic controls. Red dots denote mice with 
bone marrow failure (n = 6/29). WT, wild type. b, Bar chart eee the 
number of nucleated cells per femur of aged Aldh2-/~ Fancd2~‘~ mice (n = 6) 
with bone marrow failure and controls. Error bars represent s.e.m. ¢, Peripheral 
blood film (400), bone marrow histology stained with haematoxylin and 
eosin (H&E; X50) and immunohistochemistry for cleaved caspase-3 or 
y-H2AX (X200, inset 400) from an Aldh2~’~ Fancd2-/~ mouse with bone 
marrow failure and controls. d, e, Flow cytometric analysis of y-H2AX in total 
bone marrow and LKS populations with quantification of the relative induction 
in Aldh2-/~ Fancd27/~ (error bars represent s.e.m., n = 2, *P < 0.05, 

**P < 0.01, ***P < 0.001). 


strict in mouse than in man’””*’. The identity of the enzyme responsible 
for Aldefluor activity was unknown, although it has recently been 
shown that it is not due to Aldh1al (ref. 17). In contrast, we find that 
the Aldefluor activity is greatly abrogated in Aldh2 ’~ mice. There is a 
marked reduction in the Aldefluor activity of total bone marrow and of 
the HSPC pool, with an intermediate reduction in Aldh2‘’” mice 
(Fig. 3b, c). Furthermore, Aldh2 is the most highly expressed member 
of the aldehyde dehydrogenase family in the LT-HSC population’””. 
These data suggest that Aldh2 is the major source of Aldefluor activity 
and acetaldehyde detoxification within the HSPC pool. 

Having established that Aldh2 and Fancd2 protect HSPCs against 
acetaldehyde-mediated genotoxicity, we set out to establish the direct 
physiological consequences of disruption of both genes on HSPC 
function in young 8-12-week-old mice. First we used immunopheno- 
typing to determine the frequency of HSPCs within the bone marrow 
of young double-mutant mice that had not developed leukaemia or 
bone marrow failure”. The data in Fig. 4a show a 30-fold reduction 
in the LKS population, in contrast to the mild twofold reduction in 
Fancd2~'~ mice. Further analysis of this population revealed that 
there was a tenfold reduction in the frequency of LT-HSCs 
(Lin™ c-Kit* Sca-1* Flt3~ CD34_) in Aldh2~/~ Fancd2~/~ mice com- 
pared to wild-type littermates, with Fancd2’~ or Aldh2~’~ mice 
again showing only a mild reduction (Fig. 4b and Supplementary 
Fig. 6a). Similarly, bone marrow from Aldh2~/~ Fancd2-‘~ mice 
showed a significant reduction in the frequency of HSCs defined by 
SLAM markers (Lin CD41 CD48” CD150°)!. However, when 
SLAM markers were combined with canonical HSC surface markers 
(SLAM + LKS: Lin” CD41” CD48~ CD150* c-Kit* Sca-1*), a more 
pronounced 251-fold reduction in the frequency of HSCs was 
observed in Aldh2-’~ Fancd2-’~ mice (Supplementary Fig. 6b-d). 

Analysis of cell cycle dynamics revealed an accumulation in the 
S-G2-M phase of the cell cycle in double-mutant LT-HSCs. 
Furthermore, we noticed a marked reduction in the number of 
quiescent (Go) LT-HSCs in Aldh2-’~ Fancd2~’~ mice (P< 0.0001). 
The majority of LT-HSCs were actively cycling, in contrast to wild- 
type LT-HSCs, which are mainly quiescent (Fig. 4c). It has previously 
been demonstrated that damage to the bone marrow compartment 
leads to a loss of quiescence and entry of HSCs into the cell cycle”"?**. 

Finally, we functionally tested the HSPC pool of young 
Aldh2-’" Fancd2-‘~ mice. Bone marrow obtained from 
Aldh2-’~ Fancd2~‘~ mice and congenic controls was transplanted 
into lethally irradiated recipients. These recipients were then killed 
at day 10 and the number of spleen colony forming units (c.f.u.-S;9) 
was quantified. The data in Fig. 4d and Supplementary Fig. 7a clearly 
show that the number of c.f.u.-S;9 was greatly reduced (28-fold) in 
mice reconstituted with Aldh2 ’~ Fancd2-’~ marrow compared with 
wild type, in contrast to the mild reductions in the single mutants 
(Aldh2 '~, 1.8-fold; Fand2-’~, 3.2-fold compared with wild type). 
The in vitro cobblestone-area-forming cell (CAFC) assay corroborates 
this decreased frequency of HSPCs in Aldh2~’~ Fancd2~’~ marrow 
(Supplementary Fig. 7b). However, these assays have limitations 
because we cannot draw any conclusions about the long-term repopu- 
lating potential of the HSCs. To assess this function, we carried out the 
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Figure 2| Aldh2 is critical for protecting 
Fancd2'’" ST-HSCs from exogenous 
acetaldehyde. a-d, Survival of B cells, eyrthroid 
(c.fu.-E) and granulocyte-macrophage (c.f.u.- 
GM) progenitors following exposure to 
acetaldehyde in vitro. a, b, Cells were obtained 
from wild-type (WT), Fanca ’, Fancd2-’~ and 
Slx4-/~ mice (a) or wild-type, Aldh2~’~, 
Fancd2-/~ and Aldh2~/~ Fancd2~/~ mice 

(b). Each data point represents the mean of two 
independent experiments, each carried out in 
duplicate, error bars represent s.e.m. c, Scheme 
outlining the assay to determine survival of ST- 
HSCs (c.f.u.-Sj0) after exposure to acetaldehyde. 
d, c.f.u.-S;9 survival was made relative to the 
untreated controls of each genotype. Each data 
point represents the mean c.f.u.-S;o survival among 
eight recipient mice. Error bars represent s.e.m. 


Figure 3 | Aldh2 is responsible for Aldefluor 
activity in murine HSPCs. a, Scheme outlining 
the Aldefluor assay. The substrate is oxidized by 
Aldh enzymes and the charged fluorescent product 
accumulates. DEAB, a general inhibitor of Aldh 
enzymes, was used as a control for background 
fluorescence. b, Flow cytometric analysis of the 
lineage-negative, LKS and long-term HSC (LT- 
HSC) populations of Aldh2*/~ and Aldh2~‘~ bone 
marrow treated with the Aldefluor reagent with 
and without DEAB. c, Quantification of Aldefluor 
fluorescence in bone marrow subpopulations from 
Aldh2*’*, Aldh2*’~ and Aldh2~’~ mice. The data 
were made relative to DEAB control and represents 
the mean of three independent experiments. Error 
bars represent s.e.m., n = 10, *P < 0.05, 

**P < 0.01, ***P < 0.001 and ****P < 0,0001 
compared with Aldh2*’* control for each 
population. 
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Figure 4 | The HSC pool of young 

Aldh2~’~ Fancd2~‘~ mice is severely 
compromised. a, Representative FACS profiles of 
HPSCs of 8-12-week-old Aldh2~/~ Fancd2~/~ 
mice, showing LKS, LT-HSC, ST-HSC and 
multipotent progenitor (MPP) populations. WT, 
wild type. b, Quantification of HPSC populations 
assessed by FACS. *P < 0.05, error bars represent 
s.e.m., n = 3 per genotype. c, Cell cycle analysis of 
the LT-HSCs in Aldh2 ’~ Fancd2‘~ mice and 
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and 4’,6-diamidino-2-phenylindole (DAPI) 
staining. d, Frequency of c.f-u.-Sj9 in the bone 
marrow of Aldh2-’~ Fancd2 ‘~ and control mice. 
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per genotype, central line represents the mean, 
error bars represent the s.e.m., ***P < 0.001. 
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long-term competitive repopulation assay. This assay allows the fre- 
quency of LT-HSCs within mutant bone marrow to be calculated 
(defined as competitive repopulation units, CRU) and is based on their 
functional ability to reconstitute all blood lineages in lethally irradiated 
mice”. The data in Fig. 4e and Supplementary Fig. 7c reveal that 
double-mutant marrow is profoundly compromised, showing a 638- 
fold reduction in the frequency of CRU compared with wild type 
(P<1X 10 ”°). It is also worth noting that the long-term repopulat- 
ing ability in Aldh2-’~ bone marrow was reduced 2.75-fold 
(P = 0.0156), demonstrating the functional importance of aldehyde 
dehydrogenase activity associated with the HSC compartment. 
Furthermore, there was a 38-fold reduction in the HSC pool of 
Fancd2~/~ mice compared to wild type (P= 5 X 10‘). It is notable 
that the combined inactivation of both acetaldehyde catabolism and 
the Fanconi anaemia DNA-repair pathway leads to a synergistic func- 
tional HSC defect. 

The data presented here reveal that HSPCs use two parallel mechan- 
isms for protection from the irreversible genetic damage caused by 
reactive aldehydes. Recently, numerous studies have identified critical 
roles for a variety of DNA-repair pathways in HSC homeostasis, with 
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defective DNA repair leading to age-dependent dysfunction of these 
cells in mice’*’®’”, We show that the combined inactivation of 
aldehyde detoxification and the Fanconi anaemia DNA-repair path- 
way results in persistent DNA damage in HSCs. This leads to the loss of 
a functional HSC pool, and precipitates spontaneous bone marrow 
failure. Children with Fanconi anaemia often succumb to bone 
marrow failure and leukaemia, in both instances due to genetic damage 
to the stem cells’”*. This study also provides the first evidence, to our 
knowledge, about what drives the Fanconi anaemia haematological 
phenotype, and has the potential to inform future therapeutic inter- 
ventions for this lethal human illness. Perhaps the most important 
implication of this work is the revelation that naturally produced 
aldehydes can be potently genotoxic to HSCs, and without the 
necessary protection, these stem cells malfunction or die, the two 
key features that typify a process of accelerated ageing. 


METHODS SUMMARY 

Mice. Aldh2~/~ Fancd2~’~ mice were described previously‘. All animals were 
maintained in specific pathogen-free conditions. In individual experiments, all 
mice were matched for age and gender. All animal experiments undertaken in this 
study were done so with the approval of the UK Home Office. 
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Haematopoietic progenitor. Survival of lineage-restricted progenitors was 
assessed using methylcellusose-based media, and survival of short-term HSCs 
was assessed using the c.fu.-S assay, as described previously'°'’. Total bone 
marrow was treated with acetaldehyde in vitro for 4h before plating in 
methylcellulose-based media or injection into lethally irradiated recipients (see 
Methods). 

Aldefluor stain. The Aldefluor stain was performed according to the manufac- 
turer’s instructions, using bone marrow cells pre-stained for surface markers to 
define HSCs, as described in Methods. Aldefluor fluorescence was quantified using 
the geometric mean. 

Limiting dilution assay. The limiting dilution assay was performed essentially as 
described previously'®. Varying amounts of male ‘test’ bone marrow (8 X 10°, 
4x 104, 2X 10°, 1 X 10°, 5 X 10° and 2.5 X 10” cells) were mixed with a fixed 
amount (2 X 10°) of wild-type female bone marrow and injected into lethally 
irradiated (900 Gy, split between two doses) female F; C57B6/Jola X 129/SV 
recipients. Twelve recipients were used for each dose of test bone marrow. After 
16 weeks, mice were assessed for multi-lineage reconstitution (see Methods). 
Statistical analysis. Unless otherwise stated, data reflect the mean + s.e.m., anda 
two-tailed Student’s t-test was used to assess the statistical significance (*P < 0.05, 
**P < 0.01, ***P < 0.001, ****P < 0.0001). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Mice. Aldh2-/~ Fancd2-’~ mice on a C57/B6Jola X 129SV hybrid background 
were described previously‘. Fancd2-deficient mice (Fancd2""""°", Mouse Genome 
Informatics (MGI) code: 2673422, 129S4/SvJae) were a gift from M. Grompe. 
Aldh2-deficient mice were generated from embryonic stem (ES) cells obtained 
from EUCOMM (Aldh2?"@@0VCOMMW!. IGT code: 4431566, C57BL/6N). Mice 
carrying both the Fancd2’""™ and Aldh2'"'¢@@UCOMMW'si synthetic alleles were 
generated by crossing Fancd2'”""'" and Aldh2'™'¢@00OMMW's! heterozygous 
mice. These progeny were subsequently intercrossed to obtain double-mutant 
mice and control genotypes as described previously*. Fanca-deficient mice were 
generated from ES cells, carrying a targeted disruption of the Fanca locus, 
obtained from the EUCOMM consortium (Supplementary Fig. 5b). Targeting 
was confirmed by long-range PCR using the following oligonucleotides. LAR3: 
CACAACGGGTTCTTCTGTTAGTCC; GF4: GCTAACACTAGTAAGAGTC 
ACAATAATCTC; RAF5; CACACCTCCCCCTGAACCTGAAAC; and GR3: 
CTTGTTGGTGGTTGGATATCTTGATGGTTG (Supplementary Fig. 5b). 
Germline transmission was achieved and progeny were genotyped using the fol- 
lowing oligonucletotides. FL033: GCCTTTGCTGCTCTAATTCCATGT; FL040: 
TCAGCTCACTGAGACGCAACCTTTTACACT; and En2A: GCTTCACTGA 
GTCTCTGGCATCTC (Supplementary Fig. 5c). Furthermore, cells derived from 
Fanca-deficient mice were unable to mono-ubiquitinate Fancd2 after damage with 
MMC (Supplementary Fig. 5d). All animals were maintained in specific pathogen- 
free conditions. In individual experiments all mice were matched for age and 
gender. All animal experiments undertaken in this study were done so with the 
approval of the UK Home Office. 

Aldefluor stain. Aldefluor stain was performed according to the manufacturer’s 
instructions, using bone marrow cells pre-stained for surface markers to define 
HSPCs as described below. Fluoresence was quantified using the geometric mean. 
Limiting dilution assay. The limiting dilution assay was performed essentially as 
described previously’. Varying amounts of male ‘test’ bone marrow (8 X 10°, 
4X 10*, 2X 10°, 1 X 10°, 5 X 10° and 2.5 X 10’ cells) were mixed with a fixed 
amount (2 X 10°) of wild-type female bone marrow and injected into lethally 
irradiated (900 Gy, split between two doses) female F; C57B6/Jola X 129/SV 
recipients. Twelve recipients were used for each dose of test bone marrow. After 
16 weeks, these recipients were killed and peripheral blood was fractionated by 
FACS into T cells (CD4" or CD8", B220 Gr-1 Mac-1"), B cells 
(B220* CD4~” CD8 Gr-1” Mac-1") and myeloid cells (Gr-1 * or Mac-1*, 
B220 CD4 CD8 ) using surface markers described previously”. The relative 
contribution of the test and wild-type bone marrow to peripheral blood 
chimaerism was then assessed using quantitative PCR (qPCR) for the Y 
chromosome with oligonucleotides described previously*®. Recipients were con- 
sidered reconstituted with test bone marrow if all three lineages showed =1% 
chimaerism for the test bone marrow. The frequency of non-reconstituted mice 
is plotted against the dose of test cells injected. The frequency of CRU was 
calculated using L-Calc and P values were calculated using ELDA software at 
http://bioinf.wehi.edu.au/software/elda. 

Statistical analysis. Unless otherwise stated, data reflect the mean + s.e.m., anda 
two-tailed Student’s t-test was used to assess the statistical significance (*P < 0.05, 
**P < 0.01, ***P < 0.001, ****P < 0.0001). 

Histological analysis. Histological analysis was performed on tissues that had 
been fixed in neutral buffered formalin for 24h. The samples were then paraffin 
embedded and 4m sections were cut before staining with haematoxylin and 
eosin. Immohistochemistry was performed as described previously”’, using rabbit 
anti-phospho-histone H2AX (Cell Signaling 2577; 1:50) and rabbit anti-cleaved 
caspase-3 (Cell Signaling Asp175 96611; 1:100). 

Peripheral blood. Peripheral blood was collected from mice at 8-12 weeks of age 
or from aged mice as indicated. Whole blood (50-100 il) was collected in ETDA 
microvette tubes (Startedt) and analysed on a VetABC analyser (Horiba). 
y-H2AX staining. y-H2AX staining by flow cytometry was performed using the 
total bone marrow of aged mutant mice and appropriate controls. Cells were 
resuspended at a concentration of 3 X 10° cells per 100 pl. The cells were stained 
for surface markers using biotin lineage cocktail (Miltenyi Biotec), anti-c-Kit 
(PerCP-Cy5.5), and anti-Sca-1 (PE-Cy7). The samples were incubated for 
15 min at 4°C in the dark. Anti-Flt3 (PE) and anti-CD34 (eFluor660) were also 
added to the stain of lineage-depleted bone marrow of young mice, and these 
samples were incubated for 75 min at 4 °C in the dark. Brilliant violet-421 strep- 
tavidin was added and the cells incubated for a further 15 min. Cells were fixed and 
permeabilized using Intraprep (Beckman Coulter) following the manufacturer’s 
instructions. The cells were then stained for 15 min with a FITC-conjugated anti- 
y-H2AX antibody (JBW301, Upstate). 

Survival assays of primary mouse B cells. Survival assays of primary mouse B 
cells were performed with lymphocytes purified from the spleen using Lympholyte 
M (Cederlane). Lymphocytes were stimulated with LPS (Sigma L4391) at a final 


concentration of 40 1g ml '. A total of 4 X 10° cells were plated with acetaldehyde 
in one well of a 24-well plate. After 7 days the viable cells were enumerated by 
trypan blue exclusion, counting 100 images using a ViCell XR (Beckman Coulter). 
Each data point represents the mean of three independent experiments, each 
carried out in triplicate. 

Methylcellulose c.f.u. survival assays. Methylcellulose c.f-u. survival assays were 
carried out as described previously'°’. Briefly, total bone marrow was flushed in 
IMDM (GIBCO) from the femora and tibiae of mutant mice and controls. The 
number of nucleated cells was enumerated using a solution of 3% acetic acid and 
methylene blue, and counted using a ViCell XR Cell counter. Equal numbers of 
total bone marrow cells were then exposed to various concentrations of acetalde- 
hyde in vitro for 4h in a sealed CryoVial. After treatment, two tenfold serial 
dilutions of the bone marrow were made, and these cells were plated into 6-well 
plates with methylcellulose-based media containing cytokines. 

For c.f.u.-GM, cells were plated at final concentrations of 2 X 10°, 2 X 10° and 
2X 10* cells per well in MethoCult M3534 (StemCell Technologies). Colonies 
were counted 7 days after incubation at 37 °C and 5% COs. For c.f.u.-E, cells were 
treated in the same manner, but were plated into MethoCult M3334 (StemCell 
Technologies) and colonies were counted after 2 days. In all cases survival was 
made relative to the untreated control for each genotype. Each point represents the 
mean of two independent experiments, each carried out in duplicate. 
c.f.u.-S assays. c.f.u.-S assays were performed as described previously'*"'*. To 
assess the frequency of c.f.u.-S in mutant mice, total bone marrow was flushed 
from the femora and tibiae of mutant mice and appropriate controls. Nucleated 
cells were enumerated using a solution of 3% acetic acid and methylene blue. The 
mutant bone marrow was then injected intravenously into 20 recipient irradiated 
mice that had been lethally irradiated. 1 X 10° nucleated bone marrow cells were 
used for wild-type, Aldh2-’~ and Fancd2-‘~ bone marrow, and 5 X 10° cells for 
Aldh2~’~ Fancd2-/~ bone marrow. After 10 days the spleens were fixed in Bouin’s 
solution (Sigma), the number of colonies were counted and made relative to the 
number of total bone marrow cells injected. 

To assess the survival of c.f-u.-S after exposure to acetaldehyde, we treated total 
bone marrow cells with various acetaldehyde concentrations for 4h in vitro before 
injecting them into lethally irradiated recipient mice. After 10 days, the number of 
c.f.u.-S were counted. The survival was made relative to the untreated control for 
each genotype. Each data point represents the mean c.f.u.-S survival in eight 
recipient mice. 

Irradiation of mice. Irradiation of mice was performed using a Cs-137 GSR Clm 
blood irradiator (Gamma-Service Recycling Gmbh; Germany). Mice received a 
dose of 900 Gy of total body irradiation, split between two equal doses, separated 
by 4h. Mice received prophylactic enrofloxacin (Baytril, Bayer) in the drinking 
water for 7 days before irradiation and for 5 weeks after irradiation. 

Flow cytometry. Flow cytometry was performed on bone marrow cells that were 
isolated from the femora and tibiae of mutant mice and appropriate controls by 
flushing cells and passing them through a 70-.m filter. The following antibodies 
were used to stain for HSCs: FITC-conjugated lineage cocktail with antibodies 
anti-CD4 (clone H129.19, BD Pharmingen), CD3e (clone 145-2C11, eBioscience), 
Ly-6G/Gr-1 (clone RB6-8C5, eBioscience), CD11b/Mac-1 (clone M1/70, BD 
Pharmingen), CD45R/B220 (clone RA3-6B2, BD Pharmingen), Fce Rlo (clone 
MAR-1, eBioscience), CD8a (clone 53-6.7, BD Pharmingen), CD1 1c (clone N418, 
eBioscience) and TER-119 (clone Terl119, BD Pharmingen), anti-c-Kit (PerCP- 
Cy5.5, clone 2B8, eBioscience), anti-Sca-1 (PE-Cy7, clone D7, eBioscience), anti- 
Flt3 (PE, clone A2F10, eBioscience) and anti-CD34 (eFluor660, clone RAM34, 
eBioscience). When staining for SLAM markers the same lineage cocktail was used 
(FITC) with the addition of the following antibodies: anti-CD48 (FITC, clone 
HM48-1, BioLegend), anti-CD41 (FITC, clone MWReg30, BD Pharmigen), 
anti-CD150 (APC, clone TC15-12F12.2, BioLegend) and anti-c-Kit and Sca-1 as 
above. Maturation of B cells was assessed using anti-CD45R/B220 (PE, clone RA3- 
6B2, BD Pharmingen) and anti-IgM (APC, clone II/41, BD Pharmingen). The 
maturation of the erythroid lineage was analysed using antibodies anti- TER-119 
(APC, clone Ter-119, BD Pharmingen) and anti-CD71 (PE, clone C2, BD 
Pharmingen). Granulocyte-macrophage maturation was assessed with antibodies 
anti-CD11b/Mac-1 (APC, clone M1/70, BD Pharmingen) and anti-Ly-6G/Gr-1 
(PE, clone RB6-8C5, eBioscience). Thymic T-cell maturation was assessed using 
CD4 (FITC, clone H129.19, BD Pharmingen) and CD8a (PE, clone 53-6.7, BD 
Pharmingen) antibodies. For the FACS sorting of peripheral white blood cells the 
following antibodies were used: anti-CD4 (FITC, clone H129.19, BD Pharmingen), 
anti-CD8a (FITC, clone 53-6.7, BD Pharmingen), anti-CD45R/B220 (APC, clone 
RA3-6B2, BD Pharmingen), anti-CD11b/Mac-1 (PE, clone M1/70, BD 
Pharmingen) and anti-Ly-6G/Gr-1 (PE, clone 1A8, BD Pharmingen). The samples 
were incubated for 15 min at 4 °C in the dark with the exception of samples contain- 
ing anti-CD34 (RAM34), which were incubated for 90 min. Samples were run on a 
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LSRII flow cytometer (BD Pharmingen) and the data were analysed with FlowJo 
9.3.1 (Tree Star). 

Y chromosome qPCR. Y chromosome qPCR was performed on genomic DNA 
from total peripheral blood or FACS-sorted peripheral blood cells (T cells 


(B220. Gr-1. Mac-1. CD4* or CD8"*), B cells (Gr-1 Mac-1_ CD4 CD8~ 


B220*) or myeloid cells (CD4  CD8™ B220- Gr-1~ or Mac-1*)) using 
QIAamp DNA micro kit (Qiagen). qPCR was performed using SYBR GreenER 
qPCR Supermix (Invitrogen). The level of male chimaerism was calculated using 
oligonucleotides to amplify the genomic regions of the Tspy and B-actin genes and 
the AAC; method”. qPCR was performed in triplicate using the following con- 
ditions: 50 °C for 2 min and 95 °C for 10 min, followed by 50 cycles of 95 °C for 15 s, 
and 60 °C for 1 min, on a 7900HT Fast Real-Time system (Applied Biosystems). 
Ki-67 staining. Ki-67 staining was performed on total bone marrow that was 
lineage depleted using a magnetic bead lineage depletion kit (Miltenyi Biotec). The 
cells were then stained for surface markers: biotin lineage cocktail (Miltenyi 
Biotec), streptavidin (APC-Cy7), c-Kit (PerCP-Cy5.5), Sca-1 (PE-Cy7), CD34 
(eFluor660) and Flt3 (PE). The cells were fixed and permeabilized using the 
Cytofix/Cytoperm kit (BD Pharmigen) following the manufacturer’s instructions. 
The cells were then stained overnight at 4 °C with FITC mouse anti-human Ki-67 
(B56, BD Pharmigen). The samples were washed and resuspended in PBS plus 
3% FCS with DAPI, at a final concentration of 3 1M, and analysed immediately. 
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CAFC assay. The CAFC assay was performed on a layer of freshly established 
wild-type murine stromal cells, as described previously**. The stromal layer was 
cultured at 33 °C and 5% CO, until confluency was reached and was then inacti- 
vated by irradiation with 15 Gy. Single-cell suspensions of whole bone marrow 
were prepared at various concentrations and seeded on the stromal layer in 96-well 
plates with MyeloCult Media (StemCell Technologies) as described previously**”’. 
The plates were incubated at 33 °C and 5% CO, for 21 days, and colonies scored as 
described previously*’. The CAFC frequency was calculated based on the Poisson 
distribution using L-Calc software (StemCell Technologies). 
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The Fun30 nucleosome remodeller promotes 
resection of DNA double-strand break ends 
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Chromosomal double-strand breaks (DSBs) are resected by 5’ 
nucleases to form 3’ single-stranded DNA substrates for binding 
by homologous recombination and DNA damage checkpoint 
proteins. Two redundant pathways of extensive resection have 
been described both in cells'* and in vitro**®, one relying on 
Exol exonuclease and the other on Sgs1 helicase and Dna2 nucle- 
ase. However, it remains unknown how resection proceeds within 
the context of chromatin, where histones and histone-bound 
proteins represent barriers for resection enzymes. Here we identify 
the yeast nucleosome-remodelling enzyme Fun30 as a factor 
promoting DSB end resection. Fun30 is the major nucleosome 
remodeller promoting extensive Exol- and Sgs1-dependent resection 
of DSBs. The RSC and INO80 chromatin-remodelling complexes 
and Fun30 have redundant roles in resection adjacent to DSB ends. 
ATPase and helicase domains of Fun30, which are needed for 
nucleosome remodelling’, are also required for resection. Fun30 
is robustly recruited to DNA breaks and spreads along the DSB 
coincident with resection. Fun30 becomes less important for resection 
in the absence of the histone-bound Rad9 checkpoint adaptor protein 
known to block 5’ strand processing® and in the absence of either 
histone H3 K79 methylation or y-H2A, which mediate recruitment 
of Rad9 (refs 9, 10). Together these data suggest that Fun30 helps to 
overcome the inhibitory effect of Rad9 on DNA resection. 

To identify novel proteins that are important in homologous recom- 
bination, we screened a pool of 4,836 homozygous diploid yeast dele- 
tion mutants" for those with altered rates of integration of a URA3 
cassette at two independent loci (ISU1 and SOD2) (Fig. 1a). These 
screens identified most of the genes known to be essential in homo- 
logous recombination, genes that are known to suppress gene target- 
ing, such as SGS1 and EXO1, as well as several novel genes (Fig. 1b and 
Supplementary Table 1). Higher rates of gene integration in the sgs1A 
and exolA mutants are probably attributable to increased stability of 
the transformed linear DNA fragments’? because Sgs1 helicase and 
Exol nuclease enzymes are involved in the degradation of 5’ strands at 
DSB ends**. The Mph1 helicase that was also identified in the screen 
may inhibit gene targeting by unwinding early recombination inter- 
mediates'*. Thus, degradation of a transformed integration cassette by 
resection enzymes and disruption of early recombination intermedi- 
ates by DNA helicases probably prevents efficient integration in cells. 
In addition to the sgs1A, exo1A and mph1A mutants, gene targeting 
was significantly increased in the fun30A mutant (Fig. 1b), suggesting 
that Fun30 may also be involved either in DSB end resection or in 
processing recombination intermediates. Fun30 is a member of the 
well conserved yet poorly characterized Etll subfamily of Snf2 nucleo- 
some-remodelling factors'*, acts as a homodimeric ATP-dependent 
nucleosome-remodelling enzyme’ and has been implicated in gene 
silencing’*"*. 

To assess a possible role for Fun30 in resection, we deleted FUN30 
in a haploid strain where a DSB was induced by HO (homothallic 


switching endonuclease) at the MAT locus and the HML and HMR 
homologous sequences were deleted to prevent repair by homologous 
recombination. Southern blot analysis with multiple probes on both 
sides of the break allowed us to follow the loss of restriction fragments 
as a measure of 5’ strand resection at varying distances from the DSB’. 
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Figure 1 | A genome-wide screen identifies novel genes that regulate gene 
integration in yeast. a, A barcode microarray-based screen for mutants with 
altered frequency of gene integration. A mixed population of 4,836 
homozygous diploid deletion mutant strains was transformed with an 
integrating cassette (that is, isu1A::URA3) or a centromeric plasmid (control). 
Molecular barcodes representing the deletion mutants were PCR-amplified 
from genomic DNA samples isolated from both transformed pools using Cy5- 
or Cy3-labelled primers and co-hybridized to a barcode microarray to reveal 
mutants with altered gene integration efficiencies. b, Integration frequency in 
selected mutants. WT, wild type; error bars, s.d. (n = 3). 
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Deletion of FUN30 only mildly affected the initial resection next to the 
DSB but severely delayed resection at regions 5, 10 and 27-28 kilobases 
(kb) from the break site (Fig. 2A). Consistently, fun30A mutants failed 
to repair DSBs by single-strand annealing’, a mechanism of DSB 
repair between two direct repeats that requires extensive resection 
followed by annealing of the repeats (Fig. 2B). In contrast, the mutants 
deficient in INO80, SWR1 or RSC, which were previously implicated 
in resection'*”, were proficient in single-strand annealing, suggesting 
that Fun30 has a unique function in extensive resection (Fig. 2B). 
Consistent with defects in resection in fun30A mutants, loading of 
the single-stranded DNA binding protein RPA, and the Rad51 recom- 
binase, was moderately decreased within 1 kb of the DSB but greatly 
diminished 5 kb from the break site (Fig. 2C). 

To test whether Fun30 affects processing of homologous recom- 
bination intermediates, we tested crossover frequency using an ectopic 
recombination assay involving MATa and MATa-inc sequences 
located on chromosomes V and III, respectively*’. Crossover and 
non-crossover products can be distinguished on the basis of restriction 
fragment size (Supplementary Fig. 1). In this assay, the fun30A mutant 
showed a repair efficiency comparable to that of wild-type cells and to 
other single mutants that affect extensive resection’. However, a two- 
fold increase in crossover frequency was found in fun30A cells but not 
in mutants of the other tested remodellers (Supplementary Fig. 1). This 
is the first evidence linking a nucleosome-remodelling enzyme to 
recombination pathway choice. In this study, we focused on the role 
of Fun30 in resection. 

There are two pathways of extensive resection in yeast, one that 
depends on Exol and one that depends on Sgs1 (refs 1-3). To test 
whether Fun30 works in either one or both pathways, we compared 
resection in exo1A, sgs1A and fun30A single mutants with that in 
sgsIA fun30A and exolAfun30A double mutants. Both double 
mutants exhibited more severe resection defects and DNA damage 
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sensitivities than any of the single mutants (Supplementary Fig. 2), 
and the triple mutant sgs1A exo1A fun30A had the same pattern of 
resection as sgs1A exo1A double-mutant cells (Supplementary Fig. 2 
and ref. 3). Therefore, we conclude that Fun30 contributes to both the 
Exol and the Sgs1 resection pathways. 

Fun30 probably regulates resection directly, because we observed 
robust Fun30 recruitment to DSBs which spread in both directions 
along the DSB coincident with resection (Fig. 3a). Also, Fun30 co- 
immunoprecipitates with other resection proteins including Exol, 
Dna2 and RPA (Supplementary Fig. 4). Further evidence comes from 
the observation that in fun30A cells resection enzymes were recruited 
next to DSB ends but failed to spread efficiently farther from the break 
site (Fig. 3b), suggesting that Fun30 is important for the progression of 
resection. Finally, both helicase and ATPase domains needed for 
nucleosome remodelling by Fun30 (ref. 7) are also required for effi- 
cient resection (Fig. 3c and Supplementary Fig. 5). It is unlikely that 
Fun30 has an indirect role in resection via changing the general 
chromatin structure, because the patterns of nucleosome positioning 
at multiple loci around an HO cut site in fun30A and the wild-type cells 
are comparable (Supplementary Fig. 3). Also, microarray analysis of 
genome-wide gene expression showed that FUN30 deletion cause only 
modest changes in expression of 14 genes unrelated to DNA damage 
response’”'® (Supplementary Table 2), and cellular levels of proteins 
involved in resection were comparable in wild-type and fun30A cells 
(Supplementary Fig. 3). 

Chromatin remodelling at DSBs could occur independently of 
resection or in a way coupled with resection. The second possibility 
is more likely because recruitment of Fun30 to DSBs is severely 
impaired in mutants defective in resection, such as sgs1A exolA and 
mre11A, and in the absence of active Cdk1 kinase**”* (Fig. 3d). This 
defect is unlikely to be due to poor checkpoint activation because 
Fun30 was efficiently recruited in checkpoint-deficient cells 
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Figure 2 | fun30A mutants are deficient in resection. A, a, Diagram of the 
MAT locus and probes used to follow resection kinetics. b, c, Southern blot 
analysis (b) and kinetics (c) of DSB end resection in wild-type and fun30A 
mutants. Plotted in this and all following graphs are mean values of three 
independent experiments with error bars denoting s.d. B, a, Schematic 


representation of single-strand annealing assay’” (SSA) b, c, Southern blot analysis 
of SSA kinetics (b) and cell viability in response to a DSB repaired by SSA 

(c). C, The relative enrichment of Rad51 and RPA at DSBs in wild-type and 
fun30A cells analysed by chromatin immunoprecipitation (ChIP). 
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Figure 3 | Fun30 is directly involved in DSB end resection. a, ChIP analysis 
of Fun30-13 Myc recruitment at a DSB. b, Recruitment of resection enzymes 
analysed by ChIP. Statistically significant differences in protein enrichment are 
indicated: *P < 0.05, **P < 0.01, t-test. Relative enrichment in fun30A is 

shown. c, Analysis of resection kinetics in indicated FUN30 mutants. The levels 


(mec1A tel1A sml1A), albeit with a delay (Fig. 3d). Together with the 
observation that Fun30 spreads along the DSB coincident with 
resection and is required for efficient spreading of resection enzymes 
away from the DSB, we reason that Fun30-mediated chromatin 
remodelling and resection appear in a coupled manner. Consistent 
with this view, there is no detectable change in histone H3 occupancy 
before resection as monitored by histone H3 chromatin immunopre- 
cipitation. We note that histones remain bound to DSB ends longer in 
fun30A or sgs1A exo1A mutants, but this can be attributed to slower 
resection (Supplementary Fig. 6 and ref. 24). 

Previous studies indicated an important role for the INO80 (ref. 20), 
RSC” and SWRI (ref. 18) ATP-dependent nucleosome-remodelling 
complexes in resection. To examine possible redundant activities in 
resection, we created single mutants lacking individual components of 
these and other complexes and the corresponding double mutants 
additionally lacking Fun30. No significant resection defect was 
observed in mutants deficient in CHD1, ISW1a/b or SWR1 activity 
alone, and in the double mutants the defect was comparable to that in 
the fun30A single mutant (Supplementary Figs 7 and 8). Also, Rad54, a 
Snf2-like protein’’, has no role in resection. However, the damage 
sensitivity of rad54A is epistatic with fun30A, providing further evid- 
ence for Fun30’s role in the homologous recombination pathway 
(Supplementary Fig. 9). A very mild defect was observed in arp8A 
(INO80 complex) mutant cells, and in rsc2A mutant cells initial resec- 
tion was significantly delayed, a phenotype similar to but weaker than 
that observed in MRX mutants’. The arp8A fun30A and rsc2A fun30A 
double mutants showed more dramatic defects in resection and DNA 
damage response than the single mutants (Supplementary Figs 7 and 
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of mutant proteins and their recruitment to DSBs were comparable to wild-type 
Fun30 (Supplementary Fig. 5). d, ChIP analysis of Fun30-13* Myc 
recruitment in mutants deficient in resection or DNA damage checkpoint. 
Recruitment at the ARO] locus was used asa control. Au error bars, s.d. (n = 3). 


8). Elimination of all three remodelling factors impaired resection and 
checkpoint activation very severely (Fig. 4A). Taken together, these 
results indicate that Fun30 is the most important remodeller promot- 
ing extensive resection, and that initial resection close to DSBs is 
stimulated by RSC and to lesser extent by Fun30 and INO80. 

Along with tight histone-DNA interactions, the histone-bound 
checkpoint mediator protein Rad9 represents an additional barrier 
for resection. Rad9 is recruited to histone H2A phosphorylated at 
Ser 129 (also called y-H2A) and histone H3 methylated at Lys 79 by 
Dotl (refs 9, 10), and it inhibits resection both at DSBs and at 
telomeres*”®, a phenomenon conserved in evolution”””*. We examined 
the possible role of nucleosome remodellers in resection within 
nucleosomes associated with Rad9. In rad9A cells extensive resection 
is markedly increased, with an average rate of 10kbh * compared 
with 4kbh in wild-type cells’, and in hta1/2-S129* or dot1A mutant 
cells resection is moderately increased (6-7 kb h'') (Fig. 4B). 
Resection in fun30A rad9A cells was much faster than in fun30A cells 
and, similarly, Fun30 becomes partly dispensable for resection in the 
absence of y-H2A or Dotl (Fig. 4B and Supplementary Fig. 10). 
Accordingly, deletion of DOT1 partly suppressed the DNA damage 
sensitivity of fun30A mutants, indicating that the observed sensitivity 
of fun30A cells is at least partly due to a resection defect 
(Supplementary Fig. 10). Faster resection in fun30A rad9A cells was 
not due to increased INO80 or RSC activity within Rad9-free chro- 
matin, because resection kinetics in the fun30A rad9A rsc2A and 
fun30A rad9A arp8A triple mutants was still increased (Fig. 4B) 
relative to fun30A cells. These data suggest that Fun30 is particularly 
important for remodelling and resection within Rad9-bound 
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Figure 4 | Fun30 chromatin-remodelling factor promotes resection within 
Rad9-bound nucleosomes. A, Analysis of DSB end resection (a) and western 
blot showing Rad53 phosphorylation (b). STH1 expression (component of 

RSC) was eliminated by adding doxycycline. B, Analysis of resection at 28 kb 
from a DSB. Southern blots are shown in Supplementary Fig. 10. Error bars, s.d. 


chromatin. One possibility is that Fun30 helps to overcome the resec- 
tion barrier formed by Rad9. Accordingly, more Rad9 was accumu- 
lated at DSB ends in fun30A cells than in wild-type cells (Fig. 4C and 
model in Fig. 4D). 

We have identified Fun30 as a novel regulator of homologous 
recombination that promotes resection in the context of chromatin. 
Our findings in yeast shed light on the coordinated processes of DNA 
repair and chromatin remodelling and should provide insights into 
possible roles of the human homologue, SMARCADI (ref. 29), in 
DNA repair. 


METHODS SUMMARY 


All experiments were performed on the Saccharomyces cerevisiae strains listed in 
Supplementary Table 3. The resection kinetics were analysed by Southern blotting 
using probes specific to broken chromosome. Protein recruitment was followed 
by chromatin immunoprecipitation, and protein interactions were studied by 
co-immunoprecipitation and western blotting. Detailed methods are presented 
in Supplementary Information. 


(n = 3). C, ChIP analysis of Rad9-HA binding. **P < 0.01. D, Model of 
coupled resection and chromatin remodelling. a, Nucleases and helicases resect 
DNA within a nucleosome-free region next to a break. Histones or histone- 
bound Rad9 impedes further resection. b, Fun30 is recruited to nucleosomes 
being resected and increases the access to nucleosomal DNA. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Yeast strains and plasmids. All strains used are listed in Supplementary Table 3. 
The FUN30 ATPase dead mutants, helicase-domain deletion mutants and Cue- 
domain deletion mutants were first constructed using fusion PCR with a high- 
fidelity Phusion DNA polymerase (NEB). In each case, the mutation or deletion 
was introduced via a pair of overlapping PCR primers. The PCR products were 
subsequently cloned onto pXP735, a derivative of the plasmid pFA6a-kanMX6 
(ref. 30) with a TRP1 promoter inserted between the pTEF and kanMX coding 
sequences. Wild-type FUN30 was similarly cloned into pXP735. In each case, the 
FUN30 gene, including 490 bp of the 5’ untranslated region (5’ UTR), the coding 
sequence and 300 bp of the 3’ UTR was inserted between the pTEF and TRP1 
promoter sequences on the vector and verified with DNA sequencing. The res- 
ultant plasmid was then used as the template to amplify an integration cassette 
containing the 5’ UTR, the coding region, the 3’ UTR of FUN30, the TRP1 
promoter that directs expression of the kanMX gene and a tTEF fragment. The 
PCR products were transformed into a fun30A::natMX mutant. The 5' UTR of 
FUN30 and the tTEF sequences at both ends of the PCR products direct their 
integration into the genome by replacing the existing fun30A::natMX cassette via 
homologous recombination. As a result, the wild-type or mutant alleles were 
integrated at the endogenous FUN30 locus, with its expression controlled by the 
endogenous regulatory elements. Integrants were selected for G418 resistance and 
Clonat sensitivity, PCR amplified and verified with DNA sequencing. The 
sequences of all oligonucleotide primers used for constructing and sequencing 
these alleles are available upon request. 

Genome-wide screens for mutants that regulate gene integration efficiency. 
Both isu14::URA3 and sod24::URA3 gene knockout cassettes containing ~1.5 kb 
of flanking sequence on each side of the endogenous ISU1 or SOD2 locus were 
separately transformed into a pool of 4,836 homozygous diploid deletion mutants. 
About 8 X 10° independent Ura* transformants were obtained for each cassette. 
As a control, the centromeric circular plasmid pRS416 (URA3), which is main- 
tained in yeast cells ectopically, was also transformed into the pool. For each 
transformation reaction, all Ura’ transformants were pooled and a genomic 
DNA sample was extracted. An integration sample and the control sample were 
analysed with the barcode microarray to identify deletion mutants with altered 
integration frequency. Results of both the isu14::URA3 and sod24::URA3 screens 
were combined to compile a list of mutants with reduced or increased integration 
efficiency (by twofold or more). These mutants were then individually validated 
for relative integration efficiency using a third integration cassette, can14::URA3, 
in a well-controlled manner. Briefly, both the can14::URA3 cassette and a LEU2 
plasmid (pRS415) were transformed into each strain within the same transforma- 
tion reaction. Cells from such a reaction were split and plated on both SC-Ura 
plates to select for transformants resulting from integration and SC-Leu plates to 
select for cells containing the plasmid. The numbers of Ura* and Leu* transfor- 
mants obtained from each reaction were compared to calculate the relative integ- 
ration rate for each strain, with that of a wild-type strain arbitrarily set to 1.0 as a 
reference. Four such independent transformation reactions were carried out with 
each strain and the results were averaged. 

Microarray analysis of gene expression. Both the wild-type BY4741 and an 
isogenic fun30A mutant were grown in 10 ml of synthetic complete medium until 
log phase. Cells were collected and resuspended in 400 jl of TES solution (10 mM 
Tris-HCl, pH 7.5, 10 mM EDTA, 0.5% SDS). Cells were lysed by vortexing with 
glass beads after adding 400 ul of acidic phenol to the cell suspension. Cell lysates 
were clarified by centrifugation at 16,000g for 10min at 4°C. Protein in the 
supernatant was extracted once with an equal volume of phenol:chloroform 
(1:1) solution, followed by extraction once with an equal volume of chloroform. 
The resulting supernatant was mixed with three volumes of 100% ethanol and 0.1 
volume of 3M NaOAc (pH 5.2) and incubated at —80 °C for 2h. Total RNA was 
precipitated by centrifugation at 16,000g for 20 min at 4 °C and this was followed 
by washing once with 70% ethanol. After a brief air dry (5 min), RNA was dis- 
solved in 50 pl of DEPC-treated water and analysed with an Affimatrix micro- 
array. Two independent experiments were carried out for each strain and the 
results were combined. 

Chromatin immunoprecipitation. ChIP assays were carried out as previously 
described*'*’, with modification. Briefly, cultures were grown to a density of about 
1 X 10’ cells ml’ in pre-induction medium (YEP-raffinose) and HO was induced 
by adding 2% galactose. Protein-DNA complexes were crosslinked with 1% (final 
concentration) formaldehyde and incubated for 10min at room temperature 
(23 °C) with rotation. The reaction was quenched by adding glycine to a final 
concentration of 125mM and incubated at room temperature for 5 min with 
rotation. Cells were lysed with glass beads in lysis buffer (50mM HEPES, 
pH7.5, 1mM EDTA, 140 mM NaCl, 1% Triton X-100, 0.1% NaDOC, 1 mg ml! 
Bacitracin, 1mM Benzamidine and 1mM PMSF) supplemented with Roche 
protease inhibitor tablets. Cell extracts were sonicated to shear the DNA to an 


LETTER 


average size of 0.5kb. Cell debris was removed by centrifugation at 16,000g for 
10 min. Supernatant was collected and incubated with affinity-purified anti-Myc 
(Sigma M4439), anti-FLAG (Sigma M2), anti-Rad51 (gift from Dr Patrick Sung) 
or anti-Rfa2 (gift from Dr Wolf-Dietrich Heyer) antibodies overnight at 4°C, 
followed by incubation with protein G agarose beads for 4h at 4°C. The pro- 
tein-bound beads were washed twice with lysis buffer (as above), twice with lysis 
buffer containing 500 mM NaCl, twice in wash buffer (10 mM Tris-HCl, pH 8.0, 
1mM EDTA, 0.25 M LiCl, 0.5% NP-40 substitute and 0.5% NaDOC) and twice in 
X1 TE. Protein-DNA complexes were eluted with elution buffer (10 mM Tris- 
HCl, pH 8.0, 10 mM EDTA, pH 8.0, and 1% SDS) and incubated at 65 °C overnight 
to reverse crosslinking. Samples were digested with proteinase K at 37 °C for 6h. 
DNA was purified by phenol extraction and ethanol precipitation. Purified DNA 
samples were analysed by real-time quantitative PCR (ABI 7900HT) with primers 
that specifically anneal to DNA sequences located at indicated distances from the 
DSB, using the following conditions: 95°C for 10 min; 40 cycles of 95 °C for 15s, 
60°C for 1 min and 72°C for 30s. Histone loss was measured according to the 
method described in ref. 24. 

Co-immunoprecipitation. To examine the potential interaction between Fun30 
and resection enzymes, we performed co-immunoprecipitation as described in ref. 
33, with modification. Yeast cells (~2 X 10” ml!) grown in YEPD medium were 
treated with 0.1% methyl methanesulfonate (MMS) for 90 min. Treated cells or 
untreated control cells were lysed on a bead beater in lysis buffer (100 mM HEPES, 
pH8.0, 20mM Mg(Ac),, 150mM NaCl, 10% glycerol (v/v), 0.4% Nonidet P-40, 
10mM EGTA, 0.1mM EDTA, plus protease and phosphatase inhibitors). The 
extracts were gently clarified by centrifugation at 1,800g for 10 min at 4°C. The 
supernatant for each sample was collected and split into two aliquots. One thou- 
sand units of Benzonase (Sigma) were added to one of the aliquots (~1.5 ml) to 
digest genomic DNA and RNA. The supernatant was then incubated with protein 
G agarose beads for 3 h at 4 °C to capture the proteins or protein-DNA complexes 
that bind nonspecifically to the beads. Beads were discarded after centrifugation at 
1,800g for 5 min. The supernatant was incubated with 10 jg of anti-Myc or anti- 
FLAG antibody at 4°C overnight with agitation. In parallel, a mock immunopre- 
cipitation was established for each set of co-immunoprecipitation reactions by 
incubating one aliquot of MMS-treated sample with an appropriate IgG. Protein G 
agarose beads were added to each sample and the mixtures were incubated for 
another 3h at 4°C. Then the beads were subjected to extensive washing at 4 °C 
with wash buffer (100 mM HEPES, pH 7.4, 20 mM Mg(Ac)., 150 mM NaCl, 10% 
glycerol (v/v), 0.5% Nonidet P-40, 10 mM EGTA, 0.1mM EDTA, plus protease 
and phosphatase inhibitors). Immunoprecipitated proteins were eluted by boiling 
beads in <2 SDS loading buffer for 5 min. 

Western blotting. Whole-cell extracts (WCE) were prepared as previously 
described*. Yeast cells grown overnight in rich medium (YEPD) or pre-induction 
medium (YEP-raffinose: 1% yeast extract, 2% peptone, 2% raffinose) either before 
or at different times following HO break induction were collected. WCEs were 
prepared using trichloroacetic acid. Pelleted cells from 5 ml of culture were washed 
once with water and resuspended in 10% trichloroacetic acid. The cells were lysed 
by vortexing with acid-washed glass beads and the protein lysates were pelleted by 
centrifugation at 20,000g for 15 min. The pellets were washed with ice-cold 80% 
acetone and proteins were dissolved in X2 SDS sample loading buffer by boiling 
for 5 min. Samples were centrifuged for 5 min at 13,200g in a microcentrifuge and 
the supernatant was retained as the protein extract. WCEs or immunoprecipitated 
samples were resolved on a 7.5% or 8.5% SDS-PAGE gel and transferred onto a 
polyvinylidene difluoride membrane (Immobilon-P, Millipore). To examine the 
expression levels of Sgs1-9 Myc, Exol-9XMyc, Mrel1-13xMyc or Dna2- 
9XMyc in wild-type and fun30 cells, the membrane was probed with mouse 
monoclonal anti-Myc antibody (M4439, Sigma). To monitor checkpoint activa- 
tion during DSB repair, the membrane was probed with anti-Rad53 antibody (gift 
from Dr Achille Pellicioli). Anti-mouse and anti-rabbit IgG HRP-conjugated 
secondary antibodies were obtained from Santa Cruz Biotechnology. Blots were 
developed using the ECL Plus Western Blotting System (GE Healthcare). 
Analysis of resection at DSB ends. Resection of DSB ends was analysed at an HO- 
induced DSB at the MAT locus on chromosome III using Southern blots as 
previously described’. Briefly, genomic DNA was isolated by glass bead disruption 
using a standard phenol extraction method. Purified DNA was digested with 
EcoRI and separated on a 0.8% agarose gel. Southern blotting and hybridization 
with radiolabelled DNA probes specific for the MAT locus was performed as 
described previously’. The band signal intensities were analysed with 
IMAGEQUANT TL software (Amersham Biosciences) and normalized to the 
TRAI probe. DSB end resection beyond each EcoRI site for each time point was 
estimated as the percentage of the band signal intensity corresponding to the 
EcoRI fragment of interest at 1 h after break induction. 

Single-strand annealing and ectopic recombination assays. Repair by single- 
strand annealing between two partial LEU2 gene repeats was followed by Southern 
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blot analysis using a leu2 sequence probe as previously described’’. Experiments 
were done in a rad51A background to prevent repair by break-induced replication. 
Repair by ectopic recombination between chromosome V and chromosome III was 
followed by Southern blot analysis using a MATa sequence probe. Crossing-over 
was measured at an 8-h time point as previously described". To test cell viability in 
response to a DSB repaired by single-strand annealing or by ectopic recombination, 
cells were grown in pre-induction medium (YEP-raffinose) overnight to an early 
log phase. Cells were then diluted to a concentration of ~1 X 10° cellsml' and 
equal amounts of cells were plated on YEPD and YEP-Gal plates. Plates were 
incubated at 30°C for 3d. Viability was calculated by dividing the number of 
colonies grown on YEP-Gal by the number of colonies grown on YEPD multiplied 
by 100%. Three independent experiments were performed for each strain. 

Spot assays for analysing sensitivity to DNA-damaging agents. Yeast cells were 
grown in YEPD rich medium overnight to saturation. Undiluted and 1/10 serial 
dilutions of each cell culture were spotted onto plates containing camptothecin or 
phleomycin DNA-damaging agent at the indicated concentrations. Plates were 
incubated at 30 °C for 2-3 d before being photographed. 

Micrococcal nuclease digestion. Yeast cells from 80ml of culture 
(~2 x 10’ ml“) were collected by centrifugation at 4,000g and washed twice 
with distilled de-ionized H,O. The cell pellet was resuspended in 900 ul of 
sorbitol solution (1M Sorbitol, 50 mM Tris-Cl, pH 7.5). Then 0.56 wl of 14.3M 
B-mercaptoethanol and 100 pl of Zymolase 20T stock (dissolved in sorbitol solu- 
tion, 25mg ml ') were added to each sample. Samples were then incubated at 
30 °C with rotation for 25 min to digest cell wall. Spheroplasts were collected by 
centrifugation at 14,000g for 2 min and washed twice with ice-cold sorbitol solu- 
tion. The pellet was resuspended with 1.3 ml of lysis buffer (0.5 mM spermidine, 


1 mM f-mercaptoethanol, 0.075% Nonidet P-40, 50 mM NaCl, 10 mM Tris-HCl, 
pH8.0,5 mM MgCl, 5 mM CaChy, plus protease inhibitors). Micrococcal nuclease 
(Roche Applied Science) was added to each sample to a final concentration of 
100U ml‘. After mixing, 200 ull of nuclei suspension was immediately taken out 
as an undigested control. The remaining samples were subjected to MNase diges- 
tion at 37 °C. Aliquots of nuclei suspension were taken out at 2, 4, 8 and 16 min, 
and the reactions were immediately stopped by addition of 20 mM EDTA, pH 8.0, 
and 1% SDS. The supernatants were collected by centrifugation at 14,000g for 
10 min, followed by Proteinase K digestion (0.1 mgml') at 50°C for 3h. The 
resultants were purified by phenol-chloroform extraction, and DNA was recoy- 
ered by ethanol precipitation. DNA pellets were washed once with 70% ethanol, 
and then dissolved in X1 TE. After RNase treatment, samples were separated on a 
1.4% agarose gel. Nucleosome mapping at specific loci was examined by Southern 
blotting and hybridization with DNA probes specific for sequences located at the 
HO cleavage site or 5, 10 or 27 kb from the DSB. 
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The yeast Fun30 and human SMARCADI chromatin 
remodellers promote DNA end resection 
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Several homology-dependent pathways can repair potentially 
lethal DNA double-strand breaks (DSBs). The first step common 
to all homologous recombination reactions is the 5’-3’ degrada- 
tion of DSB ends that yields the 3’ single-stranded DNA required 
for the loading of checkpoint and recombination proteins. In yeast, 
the Mrel1-Rad50-Xrs2 complex (Xrs2 is known as NBN or NBS1 
in humans) and Sae2 (known as RBBP8 or CTIP in humans) 
initiate end resection, whereas long-range resection depends on 
the exonuclease Exol, or the helicase-topoisomerase complex 
Sgs1-Top3-Rmil together with the endonuclease Dna2 (refs 1-6). 
DSBs occur in the context of chromatin, but how the resection 
machinery navigates through nucleosomal DNA is a process that is 
not well understood’. Here we show that the yeast Saccharomyces 
cerevisiae Fun30 protein and its human counterpart SMARCAD1 
(ref. 8), two poorly characterized ATP-dependent chromatin remo- 
dellers of the Snf2 ATPase family, are directly involved in the DSB 
response. Fun30 physically associates with DSB ends and directly 
promotes both Exol- and Sgsl-dependent end resection through 
a mechanism involving its ATPase activity. The function of Fun30 
in resection facilitates the repair of camptothecin-induced DNA 
lesions, although it becomes dispensable when Exo1 is ectopically 
overexpressed. Interestingly, SMARCAD1 is also recruited to 
DSBs, and the kinetics of recruitment is similar to that of EXO1. 
The loss of SMARCAD1 impairs end resection and recombinational 
DNA repair, and renders cells hypersensitive to DNA damage result- 
ing from camptothecin or poly(ADP-ribose) polymerase inhibitor 
treatments. These findings unveil an evolutionarily conserved 
role for the Fun30 and SMARCAD1 chromatin remodellers in 
controlling end resection, homologous recombination and genome 
stability in the context of chromatin. 

Fun30 possesses intrinsic ATP-dependent chromatin-remodelling 
activity*, which is required to promote gene silencing in hetero- 
chromatin’. FUN30 deletion renders cells hypersensitive to camptothecin 
(CPT), whereas overexpression results in genomic instability’®. A role 
for Fun30 in the DSB response, however, remains enigmatic. While 
performing a genomic screen using a plasmid-based assay, we discovered 
that the fun30A mutant shows an increased efficiency in one-ended 
homologous recombination or break-induced replication (BIR) (Fig. 1, 
Supplementary Fig. 1 and Supplementary Table 1). We also found that 
gap repair, which is a two-ended homologous recombination reaction, 
is increased in the fun30A mutant (Supplementary Fig. 2). This shows 
that Fun30 affects a step common to all homologous recombination 
reactions. Interestingly, the fun30A mutant shares this phenotype with 
the resection mutants sgs1A and exol A’”, in which impaired resection 
slows down the degradation of transformed plasmids, favouring 
plasmid-based recombination’ (Fig. 1 and Supplementary Fig. 2). 
Altogether, this indicates that Fun30 promotes DNA end processing. 


Wouter W. Wiegant', Agnés Thierry°, Sandeep Burma‘, Haico van Attikum! & Bertrand Llorente”” 


To test whether Fun30 contributes to 5’-3’ DNA end resection, we 
analysed single-stranded (ss) DNA formation at an HO endonuclease- 
induced DSB at the MAT locus’*. Because ssDNA is resistant to 
cleavage by restriction enzymes, 5'-3’ resection at the DSB generates 
a ladder of ssDNA bands after restriction digestion of the genomic 
DNA and electrophoresis under alkaline conditions. In the absence 
of Fun30, the shortest ssDNA intermediate is formed with normal 
kinetics, but the formation of longer ssDNA intermediates is either 
delayed or abolished (Fig. 2a and Supplementary Fig. 3a). Chromatin 
immunoprecipitation (ChIP) of the ssDNA-binding replication 
protein A (RPA) complex at the HO-induced DSB confirmed these 
results (Supplementary Fig. 3c, d). Notably, we detected a similar 
resection defect at an I-Scel endonuclease cut site inserted at the 
HIS3 locus (Fig. 2c), ruling out a locus-specific effect. Our results 
indicate that Fun30 facilitates long-range end resection, which is 
further supported by a delay in the kinetics of DSB repair by single- 
strand annealing (SSA) in the fun30A mutant (Supplementary Fig. 4). 

In the combined absence of Fun30 and either Sgs1 or Exol, the 
resection defect was stronger than the defects in the corresponding 
single mutants (Fig. 2b and Supplementary Fig. 3b), leading to a more 
pronounced defect in RPA loading at the HO-induced DSB 
(Supplementary Fig. 3c). This correlated with higher plasmid-based 
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Figure 1 | fun30A and DNA end-resection mutants show high BIR 
efficiencies. BIR efficiencies of selected homozygous diploid null mutants 
relative to wild type (WT; BY4743). Mutants have been ranked according to 
their BIR efficiencies. Two BIR experiments using transformations of mutant 
pools were performed (Supplementary Fig. 1). The rank of each mutant in these 
two BIR experiments is given in parentheses. This rank is bottom-up for 
mutants with BIR efficiencies lower than wild type, and top-down otherwise. A 
schematic of the BIR assay is provided in the box. Error bars denote + mean 
absolute deviation of two independent experiments. 


Department of Toxicogenetics, Leiden University Medical Center, Einthovenweg 20, 2333 ZC Leiden, The Netherlands. @Centre National de la Recherche Scientifique, Unité Mixte de Recherche 7258, 
Centre de Recherche en Cancérologie de Marseille, Marseille F-13009, France. 3Aix-Marseille University, Unité Mixte de Recherche 7258, Marseille F-13284, France. ‘Division of Molecular Radiation 
Biology, Department of Radiation Oncology, University of Texas Southwestern Medical Center, 2201 Inwood Road, NC7.214E, Dallas, Texas 75390, USA. °CEA-DSV-Institut de Radiobiologie Cellulaire et 
Moléculaire, 92265 Fontenay aux Roses, France. “Institut Pasteur, Unité de Génétique Moléculaire des Levures, Centre National de la Recherche Scientifique and University Pierre and Marie Curie-Paris, 25 


rue du Docteur Roux, F75724 Paris Cedex 15, France. 
*These authors contributed equally to this work. 


27 SEPTEMBER 2012 | VOL 489 | NATURE | 581 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


sae2A fun30A sae2A sgs1A fun30A sgstA exotA fun30A exo1A 
rad51A rad51A rad51A rad51A rad51A rad51A rad51A rad51A 
oof oS So So o 
SSsetes “coesl 2s saan oS S888 o88888F o88gh8s% oS8SRKFF oSsskXBFX oSssqsg 
st r7 (10 kb) 4 4-17 
ar 6 (6.8 kb) 
a: 5 6.1 kb) - ——« Prey o- ee "a 
Tt | eee 1467 bb) S 
3 (5 kb) 
r2 
ee —e-  ——2 07K) “See5 «998 Jee -ee" hehe -7e 
Uncut 
1 
; ae. GBerve= Soegget 
—-) —— r1 (1.6 kb) - o-r" eee -? - 
Cut 
mm Cut 0.7 kb 
Probe = 
any wy a wr wa wy ww a 
MATa locus SHOSBBS S$ BS/BS MATa locus MATa locus MAT«. locus 
c d e 
WT fun30A fun30A fun30A 
pRS316 pRS316 pRS316-FUN30 pRS316-fun30(K603R) 
17 (12 kb) oo08Be oookBe oooRBS oookBe 7 
OCSSSreN CHOOSNKN CHGSrEN OBSSLEN 
15-6 \ ‘ 00.4 kb 
(8.5-8.6 kb) N : - = [O16 kb No CPT CPT (30 uM) 
r4 (6.9 nea, ae ~ wens E 5) 053 kb WT + pRs424 & yy ae 
13 (5.7 a, ocee ) cece oe@ = 4| 99.8 kb fun30A + pRS424 
r2 (4 kb) ‘. 4 25.3 kb WT + pExot 
1(2.4kb)— 7  5aee® "Geen -<ese 2 ‘ WT + pExo1(D173A) [iE 
2 
2 fun30A + pExo1 (Ae 
1 fun30A + pExo1(D173A) as at ans 
Uncut (1 kb) es ea amy os ot 
& 
7 we’ & yw’ x) yy’ .) yw’ 
s § ss 
Cut (0.4 kb) Qu. - =. ee « < < < 
0 1 2 4 


HIS3 locus HIS3 locus 


Figure 2 | Fun30 promotes long-range 5’—3’ DNA end resection and is 
recruited to DSBs. a, Southern blot analysis of Styl (S) and BstXI (B)-digested 
genomic DNA after alkaline gel electrophoresis. rl-r7 denote partially ssDNA 
fragments. b, As ina, except that exo1A mutants were MAT« strains, showing a 
longer uncut fragment (1.9 kb). c, Southern blot analysis of StyI-digested 


BIR efficiencies and stronger delays in the kinetics of SSA (Supplemen- 
tary Figs 2 and 4). These results demonstrate that Fun30 promotes 
both Sgsl- and Exol-dependent resection of DSBs. Interestingly, we 
observed smeared cut fragments in the SSA assay in the fun30A exo1A 
double mutant (Supplementary Fig. 4b). These indicate severely 
impaired long-range resection’, which suggests that the Sgs1 resection 
pathway depends more strongly on Fun30 than the Exol pathway. 
The ATPase activity of Fun30 is essential for its chromatin- 
remodelling activity*. Expression of wild-type Fun30, but not 
ATPase-deficient Fun30(K603R), restored end resection to wild-type 
levels in fun30A mutants (Fig. 2c). This indicates that chromatin 
remodelling driven by Fun30 facilitates long-range resection, either 
directly or indirectly. After induction of an HO endonuclease-induced 
DSB at the MAT locus, Fun30 accumulated at sites near the DSB within 
60 min and spread away at later time points (Fig. 2d), as previously 
observed for Sgs1, Dna2 and Exol (refs 2, 13). This supports a direct 
role for Fun30 in long-range resection, acting together with the Exol 
and Sgsl resection machineries. However, Fun30 could affect end 
resection indirectly by regulating gene transcription or by establishing 
an abnormal chromatin structure. The loss of Fun30 neither led to any 
major change in transcript accumulation of end-resection factors 
(Supplementary Fig. 5), nor affected nucleosome positioning at the 
HIS3 locus used to monitor resection (Supplementary Fig. 6). 
Together, these results implicate Fun30 in directly promoting long- 
range resection at DSBs. This is further supported by the fact that acute 
loss of Fun30 led to a long-range resection defect at the I-Scel break 
induced at the HIS3 locus (Supplementary Fig. 7). Interestingly, ChIP 
analysis of histone H3 and H2B occupancy around an HO-induced 
DSB at MAT showed that the loss of the histone ChIP signal is coupled 
to long-range resection in both wild-type and fun30A cells 
(Supplementary Figs 8 and 9)'*. This indicates that Fun30 does not 
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genomic DNA after alkaline gel electrophoresis to monitor ssDNA formation 
(r1-r7 fragments) at an I-Scel DSB generated at the HIS3 locus. d, Fun30-Myc 
levels at MAT before and after galactose-mediated HO induction measured by 
ChIP coupled to quantitative PCR. Error bars denote + s.e.m. of three 
independent experiments. e, Tenfold serial dilutions of yeast cultures. 


facilitate long-range resection by modulating histone occupancy, but 
rather by increasing access to DNA within DSB-associated chromatin’. 

We next investigated the physiological role of the resection function 
of Fun30. Gene conversion at a single HO-induced DSB at MAT is 
normal in a fun30A mutant, both in the presence and absence of Sgs1 
or Exol (data not shown). This shows that long-range resection is not 
essential for efficient gene conversion'’. We confirmed that the 
fun30A mutant is hypersensitive to the topoisomerase I inhibitor 
CPT, but not to the ribonucleotide reductase inhibitor hydroxyurea 
or ultraviolet light (Supplementary Fig. 10)’. Expression of wild-type, 
but not ATPase-deficient Fun30(K603R), restored CPT resistance in 
fun30A mutants (Supplementary Fig. 10a), indicating that resection 
driven by Fun30 ATPase activity protects cells against CPT- 
induced DNA damage. To show directly that the resection function 
of Fun30 is responsible for CPT resistance, we ectopically expressed 
Exol in a fun30A mutant. Expression of wild-type Exol, but not 
the Exol(D173A) nuclease-deficient mutant, suppressed both the 
resection defect and the CPT hypersensitivity of the fun30A mutant 
(Fig. 2e and Supplementary Fig. 11). This confirms that the resection 
function of Fun30 is required for the repair of CPT-induced DNA 
damage. Interestingly, the fun30A exolA and fun30A sgs1A mutants 
are more sensitive to CPT, but not to hydroxyurea, than are the 
fun30A, exo1A and sgs1A mutants (Supplementary Fig. 10b), which 
corroborates their stronger resection defects. However, the combined 
absence of Fun30 and Sae?2 led to a synergistic hypersensitivity to both 
CPT and hydroxyurea (Supplementary Fig. 10b), despite a resection 
defect that is comparable to that in the fun30A mutant (Fig. 2b), indi- 
cating that the roles of Fun30 and Sae2 in genome maintenance do not 
rely exclusively on facilitating resection”. 

Resection mutants are known to affect the type of yeast survivors 
that form by different recombination mechanisms in the absence of 
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functional telomerase’*’’. Under liquid culture conditions, cells lacking 
the Est2 subunit of telomerase accumulate mostly type II survivors. 
However, we detected almost equal proportions of type I and type II 
survivors in a fun30A est2A mutant, similar to what is observed in other 
resection-defective mutants (rad24A, rad17A (ref. 17) and exolA 
(ref. 16)) (Supplementary Fig. 12). Introduction of the cdc13-1 mutation 
that induces the formation of long ssDNA tracts at telomeres'® 
suppresses the fun30A est2A phenotype as it suppresses the phenotype 
ofa rad17A est2A mutant’’. Therefore, Fun30 affects recombination at 
unprotected telomeres, probably because of its role in resection. 
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Figure 3 | SMARCAD1 promotes end resection, homologous 
recombination and cell survival after genotoxic insults in U2OS cells. 

a, Immunodetection (top) and quantification (bottom right) of RPA foci 3h 
after 6 Gy of ionizing radiation. Scale bar, 10 um. Western blot analysis of 
SMARCAD1 in cells transfected with individual or pooled short interfering 
RNAs (siRNAs) (bottom left). Knockdown of EXO] serves as a control. Nuclei 
with more than ten RPA foci were scored. b, Western blot analysis of 
SMARCAD1 (left) and quantification of homologous recombination 
frequencies using a DR-GFP assay (right). c, Clonogenic survival of 
SMARCAD1 knockdown cells treated with CPT or the PARP inhibitor ABT- 
888. d, Immunofluorescence staining of SMARCAD1 and yH2AX at DSBs 
induced by mCherry—LacI-FokI (FokI WT) at a 256X Lac operator genomic 
array (top). Nuclease-deficient mCherry—LacI-FokI(D450A) was used as a 
control. Quantification of cells showing co-localization of SMARCAD1 and 
yH2AX at Fokl-induced DSBs (bottom). Scale bar, 5 um. e, Quantification of 
GFP-SMARCAD1, GFP-EXO1 and GFP-RPA accumulation at sites of laser 
micro-irradiation in live cells. DAPI, 4',6-diamidino-2-phenylindole; siScr, 
scrambled siRNA. Error bars denote s.e.m. (a, b and d) and + s.e.m. (cand e) of 
three independent experiments for all plots in a, b, c and e. 


SMARCAD1 is the human Snf2 family member that has the 
highest sequence similarity with Fun30. SMARCAD1 may function 
in the DNA-damage response because it is phosphorylated at 
canonical (Ser/Thr-Gln) ATM/ATR phosphorylation sites, as well as 
at non-canonical sites, in response to genotoxic insults’’”°. We 
examined whether SMARCAD1 also promotes DNA end resection. 
SMARCADI1 knockdown reduced the accumulation of RPA into 
ionizing radiation-induced foci (IRIF) (Fig. 3a), as well as that of 
green fluorescent protein (GFP)-tagged RPA at laser micro- 
irradiation-induced DSBs in U2OS cells’! (Supplementary Fig. 13a). 
Accordingly, we found that SMARCAD1 knockdown reduced ssDNA 
formation as determined by directly staining ssDNA-associated 
5-bromo-2’-deoxyuridine (BrdU) IRIF (Supplementary Fig. 13b). 
These phenotypes are similar to those seen after knockdown of 
EXO1, a major resection enzyme in human cells”’, indicating that 
the absence of SMARCAD1 impairs resection. In accord with a 
resection defect, we found that the loss of SMARCAD1 also impaired 
recombinational DSB repair. SMARCAD1 knockdown cells (1) were 
defective in the repair of an I-Scel-induced DSB by gene conversion in 
the direct-repeat (DR)-GFP reporter” (Fig. 3b); (2) showed a major 
reduction in the repair of CPT-induced DSBs as monitored by the 
disappearance of TP53BP1 foci in S/G2 phase cells (Supplementary 
Fig. 13c); and (3) were hypersensitive to DNA damage resulting from 
CPT or poly(ADP-ribose) polymerase (PARP) inhibitor (ABT-888) 
treatments (Fig. 3c). In addition, SMARCAD1 co-localized with 
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Figure 4 | Model for Fun30 and SMARCADI control of end resection 
through DSB-associated nucleosomes. Fun30 and SMARCAD1 weaken 
histone-DNA interactions in nucleosomes flanking DSBs, which facilitates 
ssDNA production by the Exol- and Sgs1-Top3-Rmil (STR)-Dna2 resection 
machineries. In the absence of Fun30 and SMARCAD1, histone-DNA 
interactions limit the extent of resection, but plasmid-based overexpression of 
Exol (pExol) or EXO1 (pEXO1) bypasses this impediment in yeast and U2OS 
cells, respectively. 
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or 
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yH2AX at laser-induced DNA damage and at DNA breaks generated 
by the FokI nuclease (Fig. 3d and Supplementary Fig. 13d), demon- 
strating that SMARCAD1 is recruited to DSBs. Importantly, GFP- 
tagged SMARCADI was recruited to laser micro-irradiation-induced 
lesions before GFP-tagged RPA and with kinetics similar to that of 
GFP-tagged EXO1 (Fig. 3e)”', as expected for a factor that promotes 
resection. Finally, the defect in RPA IRIF formation in SMARCADI1- 
depleted cells could be partially rescued by overexpression of EXO1 
(Supplementary Fig. 13e), indicating that SMARCAD1, like Fun30, has 
a direct role in DNA end resection and recombinational DSB repair. 

Recent reports from budding yeast’, fission yeast” and human 
cells** have shown that the Fun30 or SMARCAD1 Snf2 family 
members have related roles in promoting heterochromatinization. 
We show that Fun30 and SMARCAD1 are new DNA-damage- 
response proteins that facilitate DNA end resection and DSB repair 
in chromatin (Fig. 4). Their precise modes of action and the extent of 
their functional conservation remain to be determined. 


METHODS SUMMARY 

The yeast strains used are derivatives of S288C, W303 and JKM179 (see 
Supplementary Table 2). Details of their construction are provided in Methods. 
The BIR genomic screen was adapted from ref. 25, except that pADW17 and pLS192 
were used''. Tag arrays were from C.-Y Ho (Samuel Lunenfeld Research Institute, 
Toronto, Canada). The gap-repair assay used pSB110 (ref. 26), which contains an 
autonomous replication sequence but no centromere. Detection of ssDNA inter- 
mediates, SSA assays and ChIP experiments were performed as in refs 1 and 27. 
Transfection of U2OS cells, quantification of RPA foci after y-irradiation, co- 
immunostaining for SMARCAD1 and yH2AX after laser micro-irradiation, and 
live-cell imaging of GFP-tagged proteins to laser-induced breaks were carried out as 
described*'**. SMARCAD1 localization studies at Fokl-induced DSBs and DR-GFP 
assays were performed as previously reported”*”’. Survival of U2OS cells after CPT 
or ABT-888 treatment was quantified by the standard colony-formation assay. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Yeast strains and plasmids. The yeast strains used are derivatives of S288C, W303 
and JKM179 and are listed in Supplementary Table 2. Details of the primers used for 
gene disruption and confirmation are available on request. FUN30 was disrupted by 
one-step gene replacement using the hphNT1, KanMX4 or 13MYC::KanMX4 
cassettes”. Using such a strategy, BLY187 and BLY188 were obtained from 
LSY1709-9D, BLY185 and BLY186 from LSY1709-4A, BLY189 from LSY1983- 
16B, BLY031 from BY4741, BLY033 from BY4742, BLK020 from W303, BLY137 
from EM111, yHA629 from JKM179 and yHA630 from JKM179. SAE2 was 
disrupted by one-step gene replacement using the HIS3MX6 cassette to generate 
BLY195 and BLY196 from LSY1709-9D, and to generate BLY 197 and BLY198 from 
BLY187. rad51A yeast strains used for CPT- and hydroxyurea-sensitivity assays in 
Supplementary Fig. 10b are of the W303 background and were obtained by crossing 
the corresponding rad51A mutants with the parental W303 reference strain. 

Yeast strains used for the analysis of survivors in the absence of telomerase were 
built as follows: MNS961 was crossed with BLK020, the resulting diploid was 
grown on non-selective medium to lose pSD196 and obtain BLKO18, and 
senescence was analysed on BLK018 ascospores. To perform these experiments 
in the presence of the cdc13-1 mutation, YE1553 was crossed with MNS961 and 
the resulting diploid was sporulated to obtain BLK029. YE1552 was crossed with 
BLK020 and the resulting diploid was sporulated to obtain BLK026. BLK029 was 
eventually crossed with BLK026 and grown on non-selective medium to lose 
pSD196 and to obtain BLK033. Senescence was analysed on BLK033 ascospores. 
cdc13-1 cells were grown at 25 °C, and the presence of the cdc13-1 mutation was 
followed by its thermosensitivity at 30 °C. Senescence was analysed as described in 
the Supplementary Fig. 12 legend. 

EM111 was built as follows: the 5'-ACATAATGAATTATACAT-3’ sequence 
upstream of the regulated TATA box of HIS3 from the JKM139 strain was replaced 
by the 5’-TAGGGATAACAGGGTAAT-3’ sequence containing the I-Scel cleavage 
site. This modified HIS3 locus was then used to replace the HIS3 locus from Lev488 
(ref. 31), a W303 derivative strain containing the lys2::PGAL-I-Scel cassette. 

A plasmid containing the wild-type Fun30-coding gene was generated by intro- 
ducing a 4.5 kb Spel/NotlI fragment containing FUN30 into pRS316. An ATPase- 
deficient version of the Fun30-coding gene, which contains a point mutation in the 
ATPase domain that replaces a lysine with an arginine at position 603 
(Fun30(K603R)), was introduced by site-directed mutagenesis and confirmed 
by sequencing. 

Exol- and Exol(D173A)-overexpressing plasmids are pSM502 and pSM638, 
respectively, described in ref. 32. 

Media, growth conditions and genetic methods. Media, growth conditions and 
genetic methods are as described previously in ref. 33. Galactose induction of HO or 
I-Scel was performed as follows: one colony was grown overnight at 30 °C in either 
yeast extract peptone dextrose (YPD) or synthetic complete (SC) media. The next 
day cells were diluted and grown for about 8h using the same media to mid-log 
phase. Cells were washed with water and grown overnight in YPLGg (1% yeast 
extract, 2% peptone, 2% lactic acid, 3% glycerol, 0.05% glucose) or SC media, as 
indicated, to mid-log phase. HO or I-Scel expression was induced by adding 2% 
galactose, and switched off by adding 2% glucose. Note that HO expression in strains 
from the W303 background was driven from the centromeric plasmid pGAL:HO™. 
BIR genomic screen. Details of the screen are given in the Supplementary Fig. 1 
legend. The pool of homozygous diploid null mutants was preferred instead of the 
pool of haploid null mutants because, in principle, the genetic quality of the diploid 
deletion strain is higher”. Two independent experiments were carried out, each 
one included one transformation with circular pADW17 and one transformation 
with pLS192 linearized with SnaBI (Supplementary Table 3). For each mutant and 
for each experiment, the median of the normalized hybridization values of the 
different uptag and downtag spots was calculated”’. The mean absolute deviation 
from the two experiments is given in Fig. 1. Transformation ratios corresponding 
to BIR efficiencies were derived by dividing the median hybridization signal from 
the linear mini-chromosome pLS192 by that of the circular mini-chromosome 
pADW17 (Supplementary Table 1). The BIR defect of each mutant was confirmed 
by one or two individual transformations, and at least four individual transforma- 
tions were performed for wild-type diploid, sgs14 and fun30A. 

Alkaline electrophoresis. ssDNA intermediates were analysed by alkaline gel 
electrophoresis as described", and the blots were hybridized with double-stranded 
probes obtained by PCR. The coordinates of the MAT probe are 201176-201570 
on chromosome 3, and the coordinates of the HIS3 probe are 722025-722227 on 
chromosome 15. 

ChIP. ChIP was performed as previously described”’. In brief, cells were grown 
overnight in YPAD, diluted in YPLGg and grown to mid-log phase. Glucose was 
added to a fraction of the cells to repress HO, whereas galactose was added to the 
remainder of the cells to induce HO. Cells were fixed using formaldehyde and 
collected at 0, 1, 2, 4 and 6 h after galactose addition (cells grown in the presence of 
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glucose were collected at the 2-h time point). Extracts were prepared and subjected 
to ChIP using the following antibodies: anti-Myc (9B11; Cell Signaling 
Technology), rabbit anti-RPA (Agrisera) and rabbit anti-histone H2B or H3 
(Abcam). An aliquot of each extract was not immunoprecipitated and served as 
input. Input and immunoprecipitated DNA were purified and analysed by quant- 
itative PCR. Absolute fold enrichment for RPA or Fun30-Myc at the HO DSB was 
calculated as follows: for each time point, the signal from a site near the HO DSB at 
the MAT locus was normalized to that from the non-cleaved SMC2 locus in ChIP 
and input DNA samples. For each time point and site, the normalized ChIP signals 
were normalized to the normalized input DNA signals, because end resection 
can reduce the available DNA template. Finally, relative-fold enrichment was 
calculated by dividing the absolute-fold enrichment from induced cells to that 
of uninduced cells. 

DNA end resection and loss of histone H2B and H3 ChIP signals were calcu- 
lated as follows: to measure end resection using input DNA from ChIP experi- 
ments, the input DNA value for each site near the DSB was normalized to that of 
the input DNA value from the non-cleaved SMC2 locus. The ratios obtained for 
each time point after HO induction were normalized to that in uninduced cells. 
Similarly, to determine the loss of H2B and H3 ChIP signal, the ChIP value for 
each site near the DSB was normalized to that of the ChIP value from the non- 
cleaved SMC2 locus. The ratios obtained for each time point after HO induction 
were normalized to that in uninduced cells. ChIP results are presented as the mean 
of at least two experiments + s.e.m. 

Micrococcal nuclease digestion of chromatin. Micrococcal nuclease digestion of 
chromatin was performed as described in ref. 36. After purification, the DNA was 
digested by HindIII to show nucleosome positioning by indirect end-labelling. The 
HIS3 proximal probe used for Southern blot hybridization is the same as for 
alkaline electrophoresis experiments. The coordinates of the DED1 proximal 
probe are 722441-722809 on chromosome 15. Coordinates of the NOC2 proximal 
probe are 727786-728097 on chromosome 15. 

Irradiation of U2OS cells, SMARCAD1 knockdown, immunofluorescence 
staining and live-cell imaging. U2OS cells were irradiated with y-rays from a 
caesium source (J.L. Shepherd & Associates) or were micro-irradiated with a 
pulsed nitrogen laser (Spectra-Physics; 365 nm, 10 Hz) with output set at 75% of 
the maximum, as described in ref. 28. SMARCAD1 was depleted using siRNA1 
5'-AGGAUGCAUCUUGUCUGAAUUGAAA-3’, siRNA2 5’-GGGACGAUUG 
AAGAAUCCAUGCUAA-3’ and siRNA3 5’-GAGAUGUAGUUAUAAGGCU 
UAUGAA-3’. EXO1 was depleted using siRNA1 5'-UGCCUUUGCUAAUCC 
AAUCCCACGC-3’, siRNA2 5’-UAGUGUUUCAGGAUCAACAUCAUCU-3' 
and siRNA3 5'-UUUGUUAGUAGGUCCAUUUACCAGG-3’. The efficiency 
of knockdown for every experiment was verified by western blotting as described 
in ref. 28; cells transfected with scrambled siRNA (Invitrogen) served as controls. 
Immunofluorescence staining for YH2AX and SMARCAD1 accumulation after 
laser irradiation or Fokl expression, or for RPA or TP53BP1 foci after y-irradiation 
was carried out as described previously in ref. 28. For demarcating cells in 
S/G2 phase, nuclei were co-immunostained with anti-cyclin A antibody”. 
Quantification of foci was done as described in ref. 22. The following antibodies 
were used for immunofluorescence staining or western blotting: RPA (Abcam), 
SMARCAD1 (Bethyl Laboratories), EXO1 (Thermo Fisher), cyclin A, TP53BP1 
(Santa Cruz Biotechnology), actin (Sigma) and yH2AX (Upstate Biotechnology). 
For live-cell imaging combined with laser micro-irradiation, U2OS cells were 
transfected with GFP-RPA*, GFP-EXO1 (ref. 21) or GFP-SMARCAD1 
(OriGene), laser micro-irradiated and time-lapse imaged, and fluorescence 
intensities of micro-irradiated areas relative to non-irradiated areas calculated as 
described previously in ref. 21. To stain for BrdU and ssDNA foci, U2OS cells were 
grown in the presence of 101M BrdU (Sigma) for 16h, irradiated with 10 Gy of 
y-rays, fixed at the indicated times and immunofluorescence-stained with an anti- 
BrdU antibody (BD Biosciences) under non-denaturing conditions to detect BrdU 
incorporated into ssDNA, as described in ref. 21. For ectopic expression of EXO1 
in U2OS cells, the gene coding for the b isoform of EXO1 was sequentially cloned 
into pLenti6.3/V5-DEST by BP and LR clonase reactions (Invitrogen) using 
pEGFP-C1-Exolb” as a template. The final vector, pLenti6.3/Exolb-V5/DEST, 
was confirmed by sequencing. The accumulation of SMARCAD1 at Fokl-induced 
DSBs was monitored in U2OS cells containing 256 Lac operator repeats integrated 
into the genomeas described in ref. 29. Vectors expressing wild-type mCherry-LacI- 
Fokl or nuclease-deficient mCherry—LacI-FokI(D450A) were transfected into these 
cells, and 24h later the cells were immunostained for SMARCAD1 and yH2AX. 
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Macromolecular assemblies involving membrane proteins (MPs) 
serve vital biological roles and are prime drug targets in a variety of 
diseases’. Large-scale affinity purification studies of soluble- 
protein complexes have been accomplished for diverse model 
organisms, but no global characterization of MP-complex 
membership has been described so far. Here we report a complete 
survey of 1,590 putative integral, peripheral and lipid-anchored 
MPs from Saccharomyces cerevisiae, which were affinity purified 
in the presence of non-denaturing detergents. The identities of 
the co-purifying proteins were determined by tandem mass 
spectrometry and subsequently used to derive a high-confidence 
physical interaction map encompassing 1,726 membrane protein- 
protein interactions and 501 putative heteromeric complexes 
associated with the various cellular membrane systems. Our 
analysis reveals unexpected physical associations underlying the 
membrane biology of eukaryotes and delineates the global 
topological landscape of the membrane interactome. 

Various experimental methods have been used to examine protein- 
protein interactions (PPIs) among MPs**, but integral and lipid- 
anchored MPs are still notably under-represented in public interaction 
databases”. In contrast to the success of tandem affinity purification 
(TAP) procedures for characterizing soluble-protein complexes in 
yeast’, native MP complexes are more difficult to purify owing to their 
hydrophobic nature’. We therefore developed alternative TAP extraction 
and affinity isolation procedures, using buffers containing one of three 
different mild, non-denaturing detergents optimized for MP solubiliza- 
tion. Based on pilot studies, we selected Triton X-100, DDM (n-dodecyl- 
B-b-maltopyranoside), and C12E8 (octaethylene glycol monododecyl 
ether), as these were most effective and extracted complementary sets 
of yeast MPs (Supplementary Fig. la-c and Supplementary Text). For 
proteome-wide analysis, we used yeast strains bearing MP fusions with 
a carboxy-terminal chromosomal tag expressed at endogenous levels 
from native promoters*. The tagged MPs and stably associated proteins 
were purified essentially as described previously”, except that all steps 
were carried out in the presence of one of the three detergents. 

Based on existing database annotations, transmembrane helix (TMH) 
predictions, subcellular localization, literature curation and other 
information sources (Supplementary Fig. 1d and Supplementary 
Table 1), we compiled a target list that encompassed 2,141 annotated 
or predicted MPs (Fig. 1a and Supplementary Table 2), of which 1,590 
were tagged and processed (Supplementary Fig. 2a). These included 
1,144 putative integral, 400 peripheral and 46 lipid-anchored MPs. 
Detection coverage ranged from approximately 81% for low abund- 
ance MPs (<10° molecules per cell) to approximately 94% for high- 
abundance proteins (>1 0* molecules per cell) (Supplementary Fig. 2b)*. 


We did not attempt to purify 551 open reading frames (ORFs) because 
they were not detectably expressed’ or the growth of the tagged strains 
was impaired (Supplementary Table 2), although a substantial fraction 
(~66%; 362 of 551) were subsequently detected as interacting ‘preys’ in 
successful bait purifications. 

The overall target recovery rates were comparable for each detergent 
(~50%), and we successfully purified 77% (1,228 out of 1,590) of the 
tagged MPs (Fig. 1b). In addition to bait abundance, success depended 
on the cellular compartment (Fig. 1c) and TMH number of each MP 
(Fig. 1d). The highest coverage was obtained for those MPs associated 
with the Golgi, endoplasmic reticulum or mitochondria with fewer 
than four TMH (~80%), whereas MPs associated with the bud neck 
or possessing =10 TMH had lower success (~70%). Nevertheless, we 
were successful in purifying baits harbouring characteristic membrane- 
associated Pfam domains such as the ABC transporter, SNARE (soluble 
NSF attachment protein receptor) and PX domains (>90% success; 
Supplementary Fig. 2c and Supplementary Table 3). 

To assign confidence scores, the observed physical associations, 
including non-MPs, were first ranked using the purification enrich- 
ment score’’, and then integrated with purification enrichment scores 
computed from published yeast TAP mass spectrometry (TAP-MS) 
surveys of soluble proteins®’ to describe the cytoplasmic interface of 
membrane systems better. To optimize accuracy and coverage (Sup- 
plementary Text), we built a high-confidence ‘integrated network’ with 
the same precision (that is, a reference benchmark true positive to false 
positive ratio of approximately 22:1) as that of a highly accurate yeast 
soluble-protein interaction map (Supplementary Fig. 2a)’’. 

The integrated network consists of 13,343 high-confidence associa- 
tions among 2,875 proteins (Supplementary Table 4), representing 
two-thirds of the yeast proteome detectable by mass spectrometry”’. 
Notably, 6,082 of these PPIs are directly supported by the purification 
data described here and, for 1,726 PPIs involving at least one of 905 
putative MPs (Supplementary Table 4), two-thirds (64%; 1,110 out of 
1,726) have not been reported previously (Fig. le and Supplementary 
Fig. 2d). For example, we identified putative binding partners for 20 
plasma membrane proteins with previously unreported interactions, 
including orthologues’ of 8 human therapeutic targets (Supplemen- 
tary Table 5). 

On average, the MPs have approximately half the number of 
interaction partners as yeast soluble proteins (geometric mean ~2.1 
versus ~4.7; Supplementary Fig. 2e, f), possibly owing in part to partial 
detergent-induced dissociation. As with soluble proteins’, essential, 
highly expressed and evolutionarily conserved MPs exhibit greater 
connectivity (Supplementary Fig. 2g, h), particularly components of 
the bud, cortex and Golgi (mean connectivity of =2.8). 
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Figure 1 | Proteome- wide purification of yeast MPs. a, Yeast membrane 

compartments. Numbers indicate MPs (listed in Supplementary Table 2) whose 
purification was attempted. b, Overlap of tagged bait proteins identified by mass 
spectrometry after purification in three different detergents (left panel), and the 
proportion of successful bait recovery (right panel). c, Fraction of tagged and 
successfully purified MPs according to simplified Gene Ontology consortium 
cellular-component annotations. d, Number of tagged and purified yeast MPs 
according to TMH number. e, Overlap of MPs (top panel) and their interactions 
(bottom panel) in the MP sub-network, compared to previous high-throughput 


Independent criteria support the reliability of the integrated and MP 
interaction networks, and the quality of the underpinning data. First, 
benchmarking against a reference set of PPIs derived from MPs in the 
CYC2008 catalogue of manually curated protein complexes’! showed 
that the accuracy of the MP interactions was comparable to, or higher 
than, published small-scale experiments’*"’, protein-fragment com- 
plementation assay (PCA)? and large-scale yeast PPI networks derived 
for soluble proteins in past TAP surveys®’ (Fig. 1f). Second, we 
observed significant enrichment (P < 0.05) for interactions between 
MPs in the same, or related, subcellular compartments (Supplemen- 
tary Fig. 3a and Supplementary Table 6). Third, the average semantic 
similarity of the Gene Ontology consortium annotations of the 
interacting proteins in the MP network (Supplementary Fig. 3b) was 
comparable to or higher than that obtained for several previous high- 
confidence PPI networks**'®, although slightly lower than that 
obtained for soluble protein TAP” or literature curation’*”’, probably 
reflecting the less complete annotations of MPs in general. Finally, we 
compared our integrated network with genetic interaction data 
derived for yeast membrane’*”? and non-membrane*”’ biological 
systems (Supplementary Text). We found significant enrichment 
(P < 10°) for highly correlated genetic interaction profiles between 
pairs of physically interacting proteins compared to random pairs 
(Supplementary Fig. 3c). Collectively, these results suggest that our 
high-confidence PPI are of similarly high quality as the most reliable 
soluble yeast interactome data sets published so far®”"®"”. 
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studies**'*"” and high-confidence (HC) literature PPI from the MIPS (Munich 
Information Center for Protein Sequences) and BioGRID (Biological General 
Repository for Interaction Datasets) databases'*">. f, Average true positive-false 
positive (TP:FP) ratio of this study (MP sub-network) versus an earlier TAP- 
MS-derived soluble yeast protein data set’’, literature curated interactions'*”’, 
and a PCA yeast interactome study’, as measured against a random reference 
membrane PPI set (CYC2008 complex catalogue)'’. Note that the actual TP:FP 
ratio depends on the definition of the reference set and is given here only to 
compare data sets. ER, endoplasmic reticulum. 


To deduce the membership of MP complexes, we used the Markov 
clustering algorithm” to partition the integrated network into densely 
connected subgroups of interacting proteins, allowing for component 
sharing between clusters’ (Supplementary Text). In total, we identified 
720 clusters representing putative multi-protein complexes (Supplemen- 
tary Table 7 and Supplementary Fig. 3d), of which most (501) contain 
at least one MP (Fig. 2 and Supplementary Table 7). Many (99) of the 
264 predicted heterodimeric MP-containing complexes consisted of 
only uncharacterized proteins or one uncharacterized factor with an 
annotated MP (Supplementary Table 7), which most commonly con- 
tained Pfam domains related to the cell wall or intracellular signalling 
(Supplementary Table 8). 

Overlap between these clusters and the CY C2008 complex catalogue 
showed that of the curated yeast complexes containing an MP, 40% (67 
out of 167) had 90% or more subunits matching a cluster, whereas only 
approximately 7% (12 out of 167) of known MP complexes were 
missed by our study, presumably indicating limitations of our isolation 
procedures (Supplementary Fig. 3e). Conversely, 280 of our predicted 
MP clusters had limited overlap (<5% of the components) with 
CYC2008 complexes (Supplementary Fig. 3f), representing a rich 
resource for biological discovery. Unexpected connections were 
uncovered that suggest new roles even for well-characterized MPs, 
such as the association between Tor2 and Vps8 (Supplementary Fig. 4j) 
that may mediate TOR signalling at endosomes”. Likewise, our 
study provides additional support for proteasome engagement with 


©2012 Macmillan Publishers Limited. All rights reserved 


@ Retromer 
Vps35_ @Tbs1 


© Oligomeric Golgi 
complex 


@ Unannotated 


O Bloc! 
Cnit 


LETTER 


oa Cog8 
Vps17 ay Me YaRo08W @ @ Fsh3 
Muk1 @S'7» Vps26 Cogs ¢ fi Cogé 

Vps5 Cog3 Cog7 

N ‘ng 

Cog2) = Cog5 

@ AP1 adaptor 

Apl2 

» —@ Sas4 


<Zi\ ~~ Aps1 


Apl4 
# @ Aim14 


YFLOS4W QTD 9 nom 
Apm2 Irc6 


© Golgi to ER 


Ufe1 YKte 


Sec20 @¢ py Sec22 
Dsit « 1@ Gos! 
y 
Tip20 SS ~~ Use 


Sec39 


@ ER to Golgi 
Erv25 
ErpS AR Erp2 
| | 
Erp3 > Oy 


Emp24 @ 
Det1 


Erp1 


© COPI coatomer 
YLR422W_ Sec27 
Ribt red Glo3 
Sec28 
1@ Mips 


muss 2A 
Sec21 


© Unannotated 
Mps2 @ 
Ssf2 @—@® Skm1 


Figure 2 | Global organization of yeast MP complexes. Predicted MP 
clusters (subunits shown as similarly coloured nodes) inferred from the 
integrated network of high-confidence PPI (edges), demarcated according to 
primary compartment annotations. Representative complexes at the periphery 
highlight some of the findings of our study, including novel complexes and 


organelles through the membrane fusion factor SEC18 (Supplemen- 
tary Text and Supplementary Fig. 5a-c). 

We independently evaluated 21 interactions involving MP com- 
ponents by co-immunoprecipitation and/or iMYTH” (integrated 
membrane yeast two-hybrid) assays and confirmed >90% (19 out of 
21) (Fig. 2 and Supplementary Fig. 4a-k). For example, we validated 
the interaction of PIR (proteins with internal repeat) domain- 
containing envelope proteins Pir3 and Pir4 (Supplementary Fig. 4i), 
and the association of Sctl, an integral membrane acyltransferase 
involved in glycerolipid biosynthesis, and the uncharacterized integral 
MP YPRO91C, which localizes to the endoplasmic reticulum 
(Supplementary Fig. 4h). 

As complex subunits should exert a consistent biological role, we 
next used high-content fluorescence screening of mutant strains to 
systematically investigate the phenotypic consequences of deleting 
individual components of 26 different MP complexes (Supplemen- 
tary Fig. 6a and Supplementary Table 9). Based on the patterns of 
compartment-specific fluorescent markers, subunit loss typically 
resulted in a discernible and consistent phenotype for most (20 out 
of 26) of the predicted complexes tested (representative data shown in 
Supplementary Fig. 6; see Supplementary Table 9 and Supplementary 
Text). 

The association of 321 functionally uncharacterized proteins with 
putative MP complexes provides insight into their possible cellular 
roles (Supplementary Table 10), demonstrated by the identification 
of Irc6 as a binding partner of the heterotetrameric AP1 clathrin 
adaptor complex (Fig. 3a). AP1 is peripherally associated with Golgi 
and endosomal membranes, where it incorporates cargo proteins like 
Sna2 and chitin synthase Chs3 into clathrin-coated vesicles**. As with 
loss of the AP1 subunit Apm1, we found that loss of Irc6 increased 
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cell-surface missorting of Chs3 and Sna2 (Fig. 3b-e). Irc6 is homolog- 
ous to mammalian p34, which associates with soluble AP1 adaptors in 
human cells but has not been characterized in detail”*. Consistent with 
this, an Irc6-GFP (green fluorescent protein) fusion localizes to the 
cytosol (data not shown) and shows sub-stoichiometric binding to 
AP1 by co-immunoprecipitation (Fig. 3f). Although many AP1 
regulators recognize the y-adaptin (Apl4) appendage domain, yeast 
two-hybrid assays indicated that Irc6 interacts with the core domain of 
Apl4 and with a subunit of the AP2 complex (Fig. 3g), suggesting a 
wider role in regulating clathrin adaptors. Interestingly, the amino- 
terminal conserved domain of Irc6 (Fig. 3h), which is necessary and 
sufficient for the interaction with AP1 (Fig. 3i), has a predicted fold 
similar to that of regulatory Rab-like GTPases”®. These data suggest 
Irc6 is a conserved, functionally important AP1 interacting protein. 
A second example is the interaction of the uncharacterized protein 
Ssp120 with the annotated MPs Emp46 and Emp47, which facilitate 
secretion by sorting cargo proteins into endoplasmic-reticulum- 
derived COPII-coated vesicles”’ (Fig. 4a). Ssp120 lacks a transmem- 
brane domain but contains a signal sequence mediating endoplasmic 
reticulum translocation”® and, like Emp47, localizes to early Golgi in 
wild-type yeast®. Ssp120 is missorted to the vacuole in emp47 (but not 
emp46) mutants (Fig. 4b) and is secreted after additional deletion of 
Vps10, a receptor that directs vacuolar transport at the late Golgi 
(Fig. 4c). Ssp120 has a domain structure similar to human MCFD2 
(Fig. 4d), which binds the mammalian Emp46 and Emp47 homologue 
LMANI and is a cargo-specific adaptor for secretion of the blood 
coagulation factors V and VIII. Mutations in LMANI or MCFD2 cause 
the bleeding disorder F5F8D, or combined factor V and factor VIII 
deficiency”. Intriguingly, the C-terminal region but not the EF-hand 
motifs of Ssp120 (Fig. 4d) are important both for Golgi localization 
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Figure 3| Functional association of Irc6 with AP1. a, Model illustrating AP1 
dependent incorporation of Chs3 and Sna2 into clathrin-coated vesicles at 
Golgi and endosomal compartments. Inset indicates subunit associations of the 
AP1 complex. MVB, multivesicular body. b, Fluorescence microscopy of Chs3- 
GFP. Mutation of AP1 (Apm1) or Irc6 restores Chs3-GFP cell-surface delivery 
in chs6 mutants (see Methods). c, Loss of AP1 or Irc6 in chs6 mutants restores 
Chs3-dependent chitin ring formation. d, Quantification of chitin production. 
The graph shows the average colony fluorescence (arbitrary units, a.u.) + s.d. of 
at least six replicates. e, A Sna2(Y75A)-GFP mutant lacking an AP3 sorting 
motif mislocalizes to the cell surface in strains lacking AP1 (Apl2) or Irc6, 


(Fig. 4e) and interaction with Emp47 (Fig. 4f). Consistent with a shared 
role, both ssp120 and emp47 single mutants show calcium-sensitive 
growth, whereas ssp120 emp47 double mutants do not display 
increased sensitivity (Fig. 4g)°’. Consistent with a conserved role for 
Ssp120 as a cargo-specific adaptor, Emp47 retained its ability to escort 
Emp46 from the endoplasmic reticulum in ssp120 mutants (Fig. 4h). 
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similar to a Sna2(Y65A/Y75A)—GFP mutant lacking both AP1 and AP3 motifs. 
f, Immunoblot analyses of epitope-tagged proteins in whole-cell lysates (WCL) 
and anti-haemagglutinin (HA) immunoprecipitates (IP). Molecular masses are 
indicated on the right. g, Two-hybrid assay of the indicated Gal4-activation 
domain (AD) and DNA-binding domain (BD) fusion proteins; cells were 
grown on medium containing histidine (left panel) or tested for activation of a 
HIS3 reporter on medium lacking histidine (right panel). h, Sequence 
conservation between Irc6 and human p34. i, Two-hybrid analysis of 
interactions between AP1 subunit Apl/4 and truncated forms of Irc6. aa, amino 
acid; WT, wild type. 


Further elucidation of this and other novel predicted interactions 
should yield mechanistic insights into membrane system function, 
many of which are likely to be conserved. 

To investigate the broader evolutionary conservation of MP 
complexes, we mapped homology relationships across 71 sequenced 
eukaryotic genomes using InParanoid and Compara (Supplementary 
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Figure 4| Ssp120 participates in Golgi to ER recycling. a, Model showing 
recycling of the Emp46-Emp47-Ssp120 complex between endoplasmic 
reticulum and early Golgi. b, Fluorescence microscopy of GFP-tagged Ssp120 
in wild-type and mutant strains. DIC, differential interference contrast. 

c, Immunoblot showing secretion of tagged Ssp120 constructs from indicated 
strains. d, Domain organization of human MCFD2 and yeast Ssp120 
constructs. Signal sequences (SS) and EF-hand domains highlighted; green 
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boxes indicate additional homology, dotted lines mark deleted regions. 

e, Localization of Ssp120-GFP constructs in wild-type cells and emp47 
mutants. f, Immunoblot analyses of epitope-tagged proteins in whole cell 
lysates (WCL) and anti-HA immunoprecipitates (IP). g, Strain growth on YPD 
media containing 0.5 M CaCl). h, Fluorescence microscopy of GFP-tagged 
Emp46 in wild-type and mutant strains. Scale bars, 2 jim. 


©2012 Macmillan Publishers Limited. All rights reserved 


Table 11). Among our 501 MP-containing complexes, one-third (132) 
were present among >90% of organisms considered, whereas <10% 
(48) were restricted to fungi (Supplementary Table 12). Most MP 
complexes had at least half of their subunits conserved in worm 
(363), fly (374) and/or human (389) (Supplementary Fig. 7a), including 
90 complexes whose components were fully conserved (Supplemen- 
tary Fig. 7b). 

The MP interactions reported here reveal the global modular 
architecture of the membrane systems of a model eukaryote. A key 
element was the parallel purification of endogenous MP complexes in 
the presence of three different detergents. Although TAP-MS offers 
exceptional coverage and accuracy, our approach has potential caveats. 
The affinity tag may interfere with protein localization or interactions, 
transient associations may be lost and the detergents may disrupt 
associations, potentially contributing to the smaller average size of 
MP complexes compared to non-membrane assemblies. The data 
presented here may also be impacted by missing, incomplete or con- 
flicting information on functional annotations, genome sequences and 
subcellular localization information (Supplementary Fig. 1d), as well 
as the occurrence of spurious interactions. Nevertheless, the rate of 
false positive PPIs in the integrated network, as estimated from the 
reference benchmark precision (true positive:false positive ratio of 
22:1), is just 4.3% (corresponding to 573 non-specific interactions), 
comparable to that for previous TAP studies of the soluble yeast 
interactome®”. 

Although limitations exist, the systematic elaboration of a high- 
confidence MP interactome provides many opportunities for functional 
inference. Our results identify associations between un-annotated yeast 
MPs and functionally cohesive complexes involved in diverse biological 
processes. The conservation of many of these complexes provides 
insights into the membrane biology of eukaryotes, including humans. 
Just as genetic studies in yeast have led to the characterization of con- 
served human disease pathways, the network of MP complexes reported 
here predicts orthologous relationships that may be relevant to human 
disorders. 

All of the experimental data, interactions and predicted complexes 
of this study are publicly accessible through a dedicated database 
(http://wodaklab.org/membrane/), complementing previous maps of 
the yeast soluble proteome as valuable community resources. 


METHODS SUMMARY 


MP target selection, strain construction, TAP-MS, derivation of the PPI network 
and protein complexes, conservation analysis, phenotypic assays and other experi- 
mental and bioinformatic procedures are detailed in Methods and Supplementary 
Information. Yeast strains used in this study are listed in Supplementary Table 13. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Yeast strains, media and plasmids. The yeast strains and plasmids used in this 
study are listed in Supplementary Table 13. For the large-scale purifications, the yeast 
strains were obtained from the Yeast-TAP-fusion library deposited in Open 
Biosystems’. Standard rich (YPD), synthetic medium plus dextrose lacking histidine, 
and synthetic complete medium containing 2% glucose as a carbon source were used 
for cell growth. Standard methods were used for the introduction of DNA into yeast”. 
Tandem affinity purification and mass spectrometry. Each tagged protein was 
purified from 4L yeast cultures grown in rich media (YPD) under native condi- 
tions and prepared for mass spectrometry essentially as described previously’, 
except with some modifications of our standard procedures. With the addition 
of non-ionic detergents to our buffers, we were able to solubilize and purify the 
majority of the yeast MPs. For most of the MPs, three purifications were carried 
out in parallel using three different non-ionic detergents, and two complementary 
mass-spectrometry techniques, MALDI-TOF (matrix assisted laser desorption/ 
ionization time-of-flight) mass spectrometry and tandem LC-MS (liquid chro- 
matography-electrospray ionization—-mass spectrometry), were used to detect 
physically interacting proteins. 

These detergents not only have different abilities to solubilize affinity-tagged 
proteins (Supplementary Text) but may differ in the extent to which they disrupt 
PPI. The detergents were removed from the purified protein samples before mass- 
spectrometric identification of co-purifying polypeptides using tandem LC-MS 
instruments that are more sensitive than those used previously to characterize 
soluble yeast protein complexes’. Gel images and confidence scores for protein 
identification by MALDI-TOF MS are made available in our database (http:// 
wodaklab.org/membrane/). Confidence scores for protein identification by 
MALDI-TOF and tandem LC-MS were calculated essentially as described previ- 
ously’. A tagged protein is considered successfully purified if either it or an 
annotated or putative MP with = 2 TMH is identified by mass spectrometry 
(probability score = 90% for tandem LC-MS; Z score = 1 for MALDI-TOF mass 
spectrometry). Details on the derivation of the PPI network and prediction of 
protein complexes are described in Supplementary Text. 

Detergents. The following eleven detergents were used at a final concentration of 
1% in the affinity purification of the initial test set of yeast MPs: Triton X-100, DDM 
(n-dodecyl-f-D-maltopyranoside), C12E8 (octaethylene glycol monododecyl 
ether), LDAO (lauryldimethylamine-oxide), CHAPS, (3-[(3-cholamidopropyl) 
dimethylammonio]-1-propanesulfonate), OG (octaethylene glycol monododecyl 
ether), DM _ (n-dodecyl-B-d-maltoside CHAPSO  (3-[(3-cholamidopropyl) 
dimethylammonio]-2-hydroxy-1-propanesulfonate), FC-12  (fos-choline-12), 
Nonidet P-40 (NP-40) and deoxycholate. 

Colony overlay. The immunoblotting assay for secretion of haemagglutinin- 
tagged proteins was carried out essentially as described previously’’ with antisera 
against haemagglutinin. 

Fluorescence microscopy. The plasmids encoding fluorescent markers for actin, 
spindle and mitochondria (Supplementary Table 13) were constructed essentially 
as described previously**. The endoplasmic reticulum fluorescent plasmid, pScs2- 
REP, was a gift from T. Levine. The Golgi Sec7-RFP fluorescent plasmid was 
constructed using a SEC7 gene from the MORF (moveable ORF) library” 
through the Gateway recombineering system. For the morphological study, each 
compartment-specific fluorescent plasmid was transformed into the Y7072 MATa 
can1A::STE2pr-spHIS5 lyp1A; his3A1 leu2A0 ura3A0 met15A0 LYS2+ strain 
background. The resulting query strain with the expressed fluorescent plasmid 
was then mated to an array of MAT mutant strains through synthetic genetic 
array (SGA) technology*. After the subsequent standard SGA selections’, the 
mutants containing compartment-specific markers were visualized using high- 
content confocal fluorescent microscopy. 

For vacuolar staining, cells were pulsed with 32 1M of endocytic dye FM4-64 
(Life Technologies) for 20 min in the dark at 30 °C, after which cells were resus- 
pended in YPD and incubated for 30 min at 30 °C. Cells were subsequently washed 
twice in phosphate buffered saline (PBS) before visualization. When required, 
images were captured using a spinning disc confocal system (WaveFX; 
Quorum) with an ultra-cooled 512 back-tinned electron-multiplying charge- 
coupled device camera, or a microscope (E-600FN; Nikon) with an Orcall camera 
(Hamamatsu). Alternatively, strains expressing GFP fusion proteins were viewed 
using an Axioplan 2 fluorescence microscope (Carl Zeiss). Images were captured 
with a CoolSNAP camera (Roper Scientific) using MetaMorph software 
(Molecular Devices) and adjusted using Adobe Photoshop (Adobe Systems). 
Immunoprecipitation. The endogenously non-overproduced TAP-tagged 
proteins expressed in BY4741 by targeted homologous recombination’ were con- 
firmed by western blot analysis using anti-TAP antibody that recognizes the 
protein A epitope of the TAP tag. TAP-tagged strains were transformed with 
plasmids expressing a galactose inducible overproduced haemagglutinin-tagged 
full-length protein. The transformants were selected on synthetic-defined media 


lacking uracil. These strains were grown at 30°C in synthetic-defined media 
lacking uracil with 2% sucrose as a carbon source. The cells were then sub-cultured 
and induced for 4 h at 30 °C in yeast peptone medium with 2% galactose. The cells 
were pelleted by centrifugation at 1,900g for 20 min and then resuspended in lysis 
buffer essentially as described previously’. The lysed yeast cells were incubated for 
3h in the presence of 1% Triton X-100 at 4°C with immunoglobulin G (IgG) 
Sepharose 6 Fast Flow beads. The immunoprecipitation procedure was then 
carried out essentially as described previously’, except that in each purification 
step we added 1% Triton X-100. Alternatively, for the immunoprecipitation 
experiments in Figs 3 and 4, cell extracts were prepared from spheroplasts 
resuspended in lysis buffers (50 mM Tris-Cl pH 8.0; 50mM NaCl; 0.1% Triton 
X-100; 1mM DTT (dithiothreitol) and 1/100 EDTA-free protease inhibitor 
(Thermo) (Fig. 3); as well as 50mM HEPES pH7.9, 150mM KCl, 1% Triton 
X-100, 1mM DTT and 1/100 EDTA-free protease inhibitor (Thermo) (Fig. 4)) 
and incubated with rabbit anti-HA.11 (Santa Cruz) and protein A Sepharose beads. 
Western blot analysis of immunoprecipitated proteins was carried out using mouse 
anti-GFP (Roche) and mouse anti-HA.11 monoclonal antibodies (Covance). 

In the case of co-immunoprecipitation with untagged proteins, the standard 

protocol was adapted with protein A resins (Pierce). The mouse Pre4 and rabbit 
Rptl polyclonal antibodies were obtained from Abcam, and the Sec18 rabbit 
polyclonal antibody was a gift from C. Ungermann. 
Evaluation of Chs3 and Sna2 sorting. Assays to measure the function of the AP1 
pathway were based on the localization of AP1 cargo proteins Sna2 and Chs3. 
Sna2-GFP is sorted to the vacuole by the binding of either AP1 or AP3 adaptors to 
distinct Sna2 motifs**. The Sna2(Y75A)-GFP mutant, which lacks the AP3 motif, 
depends on AP1 for its vacuolar targeting and is mis-sorted to the cell surface in 
strains lacking AP1, similar to a Sna2 mutant (Sna2(Y65A/Y75A)-GEP) lacking 
both AP1 and AP3 motifs, thus providing a visual indication of AP1 function. 

The chitin synthase Chs3, which requires Chsé6 for transport to the cell surface, is 
retained in the cell by the AP1-dependent recycling pathway in chs6 mutants”, 
preventing chitin ring formation. Disruption of the AP1 complex restores Chs3 cell 
surface delivery through a bypass pathway and restores chitin ring formation in chs6 
mutants. Thus, chitin levels in chs6 mutants provide a quantitative measure of AP1 
function. Chitin levels were quantified by measuring the fluorescence of colonies 
grown on YPD plates containing 50 jig ml’ calcofluor white at 30°C for 3 days. 
Fluorescent-light images were captured with a Fluor S Max Multilmager (Bio-Rad 
Laboratories) using the 530DF60 filter and Quantity One software (version 4.2.1; 
Bio-Rad Laboratories), and image densitometry was performed as described previ- 
ously. Alternatively, chitin rings were visualized by fluorescence microscopy of 
cells fixed with 3.7% formaldehyde, incubated in 100 pg ml”! calcofluor white in 
0.5 M Tris pH 9.6 for 30 min at 30 °C, and washed twice before analysis. 

Yeast two-hybrid analysis. All plasmids for yeast two-hybrid analysis were 
generated by homologous recombination in pGAD-C2 and pGBDUC-2, essentially 
as described previously*®. Haploid strains expressing GAL4-AD fusion proteins were 
mated with strains expressing GAL4-BD fusion proteins. Diploid cells were tested for 
activation of the HIS3 reporter by growth on selective media lacking histidine. 
iMYTH assay. SCT1 prey generation in the pPR3N vector, endogenous tagging of 
YPRO91C with the Cub-LexA-VP16 MYTH tag and subsequent iMYTH screening 
were carried out as previously described”. In brief, bait and artificial bait expres- 
sing variants of the Saccharomyces cerevisiae NMY51 or L40 MYTH reporter 
strains were generated and transformed with prey plasmid expressing either con- 
trol construct or NubG-SCT 1p construct. Control plasmids, expressing Nubl- 
tagged (’positive’) and NubG-tagged (‘negative’) forms of the unrelated, plasma 
membrane-localized Fur4 uracil permease, were obtained from Dualsystems 
Biotech (http://www.dsbiotech.ch/). Transformed cells were picked from solid 
media, diluted into 150 ul of sterile ddH2O and spotted directly onto selective 
media. Plates were grown at 30 °C for 3-6 days and growth was scored. 
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Bud Abbott and Lou Costello made disagreements about order in baseball the stuff of comedy legend. 


AUTHORSHIP 


Who’s on first? 


When scientists collaborate on an experiment and a paper, 
it can be hard to decide who gets the credit and how much. 


BY AMBER DANCE 


how author lists come together when he 

found himself mediating a dispute. A 
postdoc and a graduate student each wanted 
to be listed as the first author on a study. 
“They both had a case,’ recalls Kosslyn. “It got 
heated.” 

Disagreements often happen when con- 
tributors put in similar amounts of effort on 
different aspects of a project, says Kosslyn, a 
psychologist at Stanford University in Cali- 
fornia. For example, one person might have 
developed the idea for the project and the other 
performed most of the data analysis. “The 
force of the dispute usually revolves around 
the feeling that whatever they did was more 


Sis Kosslyn first started to consider 


important than what the other person did,” 
says Kosslyn. 

Such disputes are common. “As authorship 
is our academic currency, it tends to bea hot- 
button topic,” says Karen Peterson, scientific 
ombudsman at the Fred Hutchinson Cancer 
Research Center in Seattle, Washington. She 
says that one-fifth of the disputes she adjudi- 
cates concern authorship. Similar conflicts are 
among the most common issues mediated by 
the Committee on Publication Ethics (COPE), 
says Virginia Barbour, the organization's chair- 
woman and chief editor of PLoS Medicine in 
Cambridge, UK. 

Authorship disagreements can be miti- 
gated with careful discussions, explicit lab 
guidelines and a good understanding of 
authorship practices in one’s field. There is no 


perfect approach, but deciding on who gets an 
authorship credit, and how they are ranked, is a 
crucial part of doing science responsibly. 

Precise statistics on authorship disputes 
are hard to come by, says Mario Biagioli, a 
science historian at the University of Cali- 
fornia, Davis, who has studied authorship. 
Scientists may be reluctant to admit that they 
have demanded undeserved authorship or 
otherwise subverted the system, and the US 
Office of Research Integrity does not track 
such disagreements because they are not con- 
sidered scientific misconduct, says Biagioli, 
who co-edited the book Scientific Authorship: 
Credit and Intellectual Property in Science 
(Routledge, 2002). However, ina 2005 survey’ 
of researchers who had received a grant from 
the US National Institutes of Health (NIH), 
10% of respondents admitted to assigning 
authorship “inappropriately”. 


CREDIT CONFUSION 

Questions of who deserves credit for a paper 
area fairly recent phenomenon, says Biagioli. 
Once upon a time, a paper had one author, 
maybe two. But with modern big science and 
large collaborations, a study might have hun- 
dreds or even thousands of authors — as in the 
case of the ATLAS experiment’ at the Large 
Hadron Collider at CERN, Europe's particle- 
physics laboratory near Geneva, Switzerland. 

And what authorship means varies by 
scientific discipline. For example, in par- 
ticle physics, hundreds of researchers may 
contribute to the development and mainte- 
nance of a single piece of equipment, such 
as an accelerator. At big physics labs such as 
CERN, everyone who was working at the lab 
when the discovery was made gets a slot on 
the author list — even if they haven't seen the 
paper, says Biagioli. The authors are usually 
listed alphabetically, regardless of how much 
they contributed. 

In the biological sciences, by contrast, the 
author list is often strictly ranked. The top 
spotis at the end of the list, where the principal 
investigator gets credit for running the lab. The 
student or postdoc who actually did the work 
goes first. As for the authors in the middle, it is 
hard to tell whether they participated a lot ora 
little, says Biagioli. 

The International Committee of Medical 
Journal Editors (ICMJE), headquartered in 
Philadelphia, Pennsylvania, has developed 
authorship guidelines that are used by many 
journals and institutions. These rules state that 
to be listed as an author, each researcher > 


27 SEPTEMBER 2012 | VOL 489 | NATURE | 591 


© 2012 Macmillan Publishers Limited. All rights reserved 


> must meet three key criteria: they must 
have been involved in designing the project, 
collecting data or analysing the results; they 
must have participated in drafting or revising 
the manuscript; and they must have approved 
the final, published paper. Many universities 
that have their own guidelines base them on 
the ICMJE’s wording, says Biagioli. 

Kosslyn has his own definition: the crucial 
element, he says, is creativity. For example, a 
researcher could work with study participants 
in the lab, but just be following a protocol. 
“Anybody could have run the subjects, so run- 
ning the subjects is not enough,” says Kosslyn. 
To earn authorship, the researcher would be 
intellectually engaged: they might point out 
a feature of the data that leads the team to 
reshape the experiment. The paper wouldn't 
look the same without them. 


THE AUTHOR IN QUESTION 

COPE recommends that researchers decide 
who will be an author and what order they will 
be listed in before they even conduct experi- 
ments, and that the group revisits the author 
list as a project evolves. A handshake isn't 
enough to seal the deal — researchers should 
keep author agreements in writing. 

Whenever they occur, authorship discus- 
sions need not be confrontational (see Aggra- 
vation-free authorship’). Mark Groudine, 
deputy director of the Hutchinson Center, says 
that the parties in a dispute should sit down 
and try to talk the matter over. “People get 
so locked into their positions that they don't 
make the effort to understand the other per- 
son's point of view,” he says, “and therefore they 
don’t understand why it’s a dispute.” 

If talking doesn’t work, Groudine suggests 
asking the opinion of an unbiased third party. 
For example, on one project he collaborated 
with another principal investigator. When it 
came to writing up the paper, both wanted 


Ombudsman Karen Peterson says that one-fifth of 
the disputes she handles are about authorship. 


to be senior author. They invited two trusted 
colleagues to mediate. 

The jury awarded the senior slot to Grou- 
dine, but he felt uneasy about it. He suggested 
that the other investigator be the correspond- 
ing author, who communicates with the jour- 
nal and any scientists who enquire about the 
work. “I consider corresponding author as 
equivalent, almost, to senior author,” says 
Groudine. Co-senior authorship is also an 
option, he adds. 

But sharing credit too broadly can be risky. 
Sometimes authors are listed more as a cour- 
tesy than because they made a key contribution, 
says Chris Sneden, an astronomer at the Uni- 
versity of Texas at Austin, who will step down 
from his post as editor of The Astrophysical 
Journal Letters at the end of this year. Accepting 
courtesy authorship is a “double-edged sword’, 
he says. If the paper becomes famous, “every 
author gets to claim credit”. But if it becomes 
infamous, everyone gets a share of the blame. 
Researchers need to be aware of the potential 
risks of adding their names to manuscripts that 


CONFLICT RESOLUTION 


Aggravation-free authorship 


When many scientists work together, 
determining authorship isn’t always easy. 
Here are some tips for settling the line-up. 
@ Make sure that you choose collaborators 
with whom you can work well. 

@ Discuss authorship early, and keep doing 
so often as a project evolves. Put it in writing. 
@ When there are disputes, first try to talk 

it out amicably and understand the other 
person’s point of view. For example, try to 
work out how the idea first came about. 

@ lf you must approach your supervisor 
about an authorship decision that you 
don’t like, keep the tone inquisitive, not 
accusatory. Explain that you want to 
understand how authorship was decided. 


@ lf acontributor’s authorship is in question, 
it can help to consider what the paper would 
have looked like without their efforts, and 
whether someone else could have made the 
same contribution. 

@ Familiarize yourself with your institution’s 
or journal’s authorship guidelines, or those 
of the International Committee of Medical 
Journal Editors. Use them to back up your 
case. 

@ Be prepared to compromise or share 
credit. 

@ If you can’t agree among yourselves, 
engage a supervisor, trusted colleagues or 
an ombudsman to investigate the matter 
and make a recommendation. A.D. 
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they know little about (see ‘Ghosts and guests’). 

Gerald Schatten, a stem-cell researcher at 
the University of Pittsburgh in Pennsylvania, 
learned that lesson when he lent his good name 
to a high-profile but eventually discredited 
stem-cell paper by Woo Suk Hwang, then at 
Seoul National University. Schatten was inves- 
tigated by his university, which cleared him of 
misconduct, but chastised him for ‘research 
misbehaviour’ because he failed to check the 
quality of the science’. 

The decision to accept courtesy authorship 
is a matter of preference, says Sneden. “Per- 
sonally, if I haven't actually contributed some- 
thing to the specific paper, I just won't have my 
name on it,” he says. In that case, he politely 
tells his colleagues that he shouldn't be on the 
list. “I make sure they understand that it’s not a 
negative reflection on the paper,’ he says. 


TAKEN IN VAIN 

Sometimes, the recipient of this courtesy may 
not get the chance to bow out. A researcher 
who has been added to the author list with- 
out their permission might be surprised to see 
their name when the paper comes out, says 
Sneden, or even angry if they don't agree with 
the conclusions. Those who find themselves 
unexpectedly an author on a paper that they 
would prefer not to be associated with should 
contact the editor of the journal, he recom- 
mends. The editor will get in touch with the 
study’s corresponding author, and decide 
whether a corrigendum is necessary to explain 
that the author in question was not involved 
with the work. 

These kinds of conflicts shouldn't occur. 
Corresponding authors are expected to have 
the approval of their co-authors — but some 
dont realize it. “People, do you read the publi- 
cation agreement that you sign?” Sneden asks 
his colleagues. (Often, the answer is no.) 

Increasingly, journals are attempting to keep 
authors in line by asking for details on who 
did what. In cases of fraud, those descriptions 
should lay the blame at the right person's door. 

Biagioli agrees that delineating each per- 
son’s contribution should help, but he says 
that the descriptions are frequently too brief. 
As an example, he cites the study published 
this month in Nature by the ENCODE Pro- 
ject Consortium’. It ascribes generic tasks 
such as “data analysis’, “writing” or “scien- 
tific management” to large sets of authors, 
making it impossible to tell, for example, 
who analysed which data. When scientists sit 
down to plan a project — and ideally draft the 
author list — they should also decide how to 
describe everyone's contributions, says Biagi- 
oli. The relevant details will probably vary by 
discipline, he adds. 

In his own lab, Kosslyn has instituted a 
scheme to make authorship requirements 
explicit from the outset. As he listened to his 
student and postdoc arguing their cases several 
years ago, he started to develop what eventually 
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COMMERCIAL PAPERS 


Ghosts and guests 


Authorship can be misused when there 
is money to be made. Medical journals 
contain a mixture of original scientific 
findings and veiled advertisements for 
drugs, and scientists and physicians 
must read papers critically to understand 
a medicine’s true merits, says Alastair 
Matheson, a biomedical-research 
consultant in Toronto, Canada. 

Some pharmaceutical companies make 
drugs and run clinical trials, then engage 
medical writers to draft manuscripts. 
These contributors are often ghostwriters 
not listed as authors on the paper. Instead, 
the company’s marketing team finds 
a big academic name to headline the 
project — even if this guest author makes 
no contribution to the paper apart from 
scanning the final version. Companies 
sometimes use the same technique to 
produce reviews promoting their latest 
medicines, says Joseph Ross, a physician 
who studies health policy at Yale University 
in New Haven, Connecticut. One survey? 
found that guests and ghosts haunted 
21% of papers published in six leading 
medical journals in 2008. 

“This vast production line of information 
about drugs is passed off as the work 
of academics rather than the work of 
industry,” says Matheson. The companies 
get to advertise their products; the 
ghostwriters receive a pay cheque; and 
the academics get another line on their 
CVs. But the patients and the integrity of 
science all lose out, says Matheson. 

For example, Merck, a pharmaceutical 
company based in Whitehouse Station, 
New Jersey, minimized reporting of the 
risks observed for its painkiller Vioxx 
(rofecoxib) until the drug was taken 
off the market in 2004. Ross was a 


became a 1,000-point system. The research- 
ers who come up with the idea get 250 points, 
split between them according to their contri- 
bution; writing the paper is worth the same. 
A further 500 points are available for design- 
ing and running the experiment and analys- 
ing the data. Researchers who score at least 
100 points make the author list, with each 
persons point total determining their rank. 

Disagreements still occur; in those cases, 
Kosslyn decides how the points are allocated. 
When the balance of contributions is unclear, 
he does his best. However, it rarely comes to 
tallying points. “Usually it’s very obvious 
what the order’s going to be,” he says. 

In recent years, no disputes have ever risen 
to the level of the argument that led to the 


consultant to people who had taken 

Vioxx and developed heart problems, or 
their families, in two court cases against 
Merck, and he saw some of the company’s 
internal documents*®. “We were sort of 
shocked to find pretty rampant evidence 
that a lot of the trials were ghostwritten,” 
says Ross. “We would stumble across a 
full draft of a manuscript that just said, 
‘external author?”.” 

There are ways to identify traces of 
guests and ghosts in a manuscript: “Check 
the small print,” says Matheson. That is 
where a medical writer or communications 
company may be acknowledged. Funding 
from a drug-maker is another tell-tale sign. 
“These are pointers to the likelihood that 
this is something originated and planned 
by industry prior to the involvement of 
the headline authors,” says Matheson. 
Author disclosures are less helpful, he 
adds, because academic authors may list 
several affiliations and it is difficult to tell 
which commercial relationship is relevant. 

With commerce and medicine intimately 
intertwined, it would be impractical for 
academics to cut ties with companies, 
says Matheson. But, he adds, when 
academics are offered guest authorship, 

“| would advise them, for the sake of their 
reputation, to do two things”. First, he says, 
be more than a guest: make sure that your 
contribution is author-level. Second, insist 
that company employees involved in the 
study are also listed as authors. 

Matheson says it is the responsibility 
of journals to make participation by drug- 
makers more apparent. He would like to 
see papers marked right at the top with 
‘commercial article’. He also suggests that 
journals use labels to indicate who funded 
the study, and what drug it supports. A.D. 


point system. “That,” says Kosslyn, “was the 
last heated dispute we had in the lab?” mSEE 
WORLD VIEW P.475 


Amber Dance is a freelance science writer in 
Los Angeles, California. 
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POSTGRADUATES 


Career-planning course 


The University of Pittsburgh in 
Pennsylvania has launched a course on 
career planning for graduate students, 
one of the first to offer degree credits. 
Planning for Scientific Success aims to 
help students to identify and develop skills 
based on their interests and values, and to 
create a lifelong career-development plan. 
Steven Wendell, a molecular biologist 
and assistant director of the postdoctoral 
office at the university, proposed the 
course. “The career problems I hear from 
graduate students and postdocs are based 
in their lack of a clear, authentic career 
vision,’ he says. The course lasts for two 
semesters and is required for oral-biology 
graduate students at the University of 
Pittsburgh dental school, but is open to all 
graduate students at the university. Each 
semester is worth one credit. 


GRADUATES 
Trouble with tracking 


Universities across Europe want to 
improve how they track graduates’ career 
progression, says the European University 
Association (EUA) in Brussels. In Tracking 
Learners and Graduates’ Progression Paths, 
published on 13 September, the EUA finds 
that if institutions follow career outcomes, 
they can take steps to improve them, 

such as revising curricula or establishing 
strategies to improve communication 
skills. But of 23 institutions surveyed, 

77% did not systematically track PhD- 
holders’ careers. Study co-author Michael 
Gaebel, head of higher-education policy 

at the EUA, says institutions should 

create a student database and team up to 
standardize data collection. 


CITATIONS 
Nobel prizes predicted 


On 19 September, Thomson Reuters 
announced its annual ‘citation laureates’, 
whom it deems likely to win a Nobel prize. 
Since 2002, 26 of the predictions have 
come true. “We're trying to demonstrate 
that there is a strong correlation between 
citation at high frequency and peer esteem 
in science,’ says David Pendlebury, lead 
analyst for citation-laureate selection 
based in Eugene, Oregon. Each year, 
Reuters chooses up to nine candidates in 
each of the fields of chemistry, economics, 
physics and medicine. The 2012 laureates 
include researchers in genetic regulation, 
quantum teleportation and reducing the 
speed of light. 
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Ua SCIENCE FICTION 


BY GEORGE ZEBROWSKI 


eo Wilson sat by the window and 
Lees the insects passing through 
the glass. 

It was an illusion of some sort, he told 
himself, unblinking to catch how it was 
done; a love of insects was fooling him. 
Dreaming, he had to wake up. 

But they went through, unstoppable solid 
ghosts — mosquitoes, ladybirds, a bee and 
even a wasp; the mix made no sense, as if 
each kind of insect was showing off. 

He watched sadly as the bugs came 
through and stuck to the sticky strips 
he had hung nearby to keep a record. 

He touched the glass and drummed. 
on it with his fingers, once again test- 
ing its solidity; the charge in the mass 
of his fingers, which produced the 
experience of solidity, was certainly 
related to what the insects were able to 
do. They knew the physics and could 
alter the well defined illusion of solid- 
ity by varying the resistance. Absolute 
solidity, non-atomistic matter, closely 
packed to forbid all space-like fields, 
did not exist. Nature had been too 
cheap to make an all-solid Universe. 
Ever parsimonious, it had inflated a 
pattern with space, not unlike Styrofoam. All 
the mass of Earth might be compressed into 
a thimble, an entire universe into a single 
point... 

No, what the insects were doing was not a 
miraculous violation of natural law. But how 
were the bugs doing it? And why were they 
doing it? 

Another group arrived at the window, 
passed through, and flew into his stickies. 
He stared in wonder, thinking that he might 
be helping them by watching, deluding him- 
self into seeing the impossible-possible. 

He checked the stickies again. Three were 
full now with writhing bodies struggling to 
break free of the glue. 

He turned back to the window and saw 
with shock that the window was now cov- 
ered with the bodies of dead insects that had 
failed to pass through — and feared that his 
turning away had caused the catastrophe. 

To come through, they needed him. 

He sat and waited for another group, 
thinking of the very small statistical chance 
that he might walk through a wall; quantum 
theory admitted as much. 

The insects, some of them, had found that 
moment. 
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OLIDARITY 


A passing thought. 


And some had not. 

Video would not do to witness the 
extraordinary event. 

But just before he called in observers, he 
went outside and pushed a finger through 
the glass, then his whole hand, but resisted 
the urge to push through with his tongue. 

Finally, he stood on tiptoes and went 


through with his head, and felt a tingle as he 
withdrew, sure now that all it took was belief, 
derived from the fact that he had seen it done. 
Being a patriotic sort, Leo convinced his 
government of the phenomenon. History 
did not need to record which governance, 
as it did not much matter after he walked 
through a wall for a few ministers. An advi- 
sory magician was sceptical until Leo took 
him by the hand and walked him through. 
Quantum probabilities had a lot to offer. 
Some months later, several heads of 
intelligence agencies went to lunch — a 
commonality among professionals in a world 
determined to quarrel instead of war, but this 
time each was asking the other what they 
were going to do about one anothers’ agents, 
the ones who were now stuck in walls. 
All the agency heads had the same prob- 
lem. They all knew Leo 
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walls and others had to be fed by hand to 
survive, the agencies called a secret meeting 
somewhere in the Mediterranean and asked 
Leo Wilson to attend. 

“What do you suggest?” they asked after 
telling him what had happened. 

“Never had that problem myself? he said. 
“T-ve forgotten I have doors in my house.” 

“You do it without... difficulties?” they 
asked. 

“Of course,” he told them. 

“What can you suggest to help us?” they 
asked. 

“Read to them about the reality of prob- 
abilities” he said. “Might loosen up their 
imaginations.” 

Tried. It did not work. 

“Ladies and gentlemen,” Leo said, 
“I don’t know what else to tell you. 
Insects are cooperative, and maybe 
they pool the energy of the hive.” It 
occurred to Leo that the insects that 
had gone splat on the glass were the ones 
with individual tendencies, not unlike 

agency spies. “Or maybe it’s something 
like ...” he started to say. 

“Like what, Mr Wilson?” they all 
asked at once, as if suspecting that he 
was holding back the vital answer. 

He sighed. “Like bird navigation. 
Our feathered friends have some- 

thing in their brains that enables them to 
go anywhere on the globe without our kind 
of GPS” 

“But you told us some of the insects failed 
to go through. Doesn't that suggest any- 
thing?” 

“Yes, but they didn’t hesitate and stick part 
way through the glass,” Leo said, thinking 
about degrees of cooperation; the spies were, 
after all, company types. 

“Help us, Mr Wilson, men are dying stuck 
in the walls!” 

“Dying?” Leo asked. 

“We have to feed these political embar- 
rassments. What do you know, what do the 
bugs know how to do that we don’t?” 

Leo looked around at the anxious faces, 
shrugged and said: “They do it better than 
we do.” m 


George Zebrowski is an award-winning 
novelist, story writer, essayist, editor and 
lecturer. He is the author of the novel 
Empties (Golden Gryphon Press) and the 
editor, with Gregory Benford, of Sentinels 
in Honor of Arthur C. Clarke (Hadley Rille 
Books). 
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it distressingly difficult to breathe owing to a 

combination of emphysema and chronic asthmatic 
bronchitis — a deadly duo known as chronic obstructive 
pulmonary disease (COPD). 

Smoking is the most common trigger for COPD, and we 
have mapped the incidence of smoking and the prevalence 
of COPD in the United States, a country with fine-grained 
data on both phenomena (page S2). But there is more to 
COPD than smoking. Evidence is emerging that some 
people might be genetically susceptible to the disease (S7), 
while researchers are picking apart the involvement of 
errant immune cells (S15). 

New treatments for COPD are in development. Hot areas 
of pursuit involve combinations of agents, as well as novel 
drug-delivery systems (S16). Most available treatments only 
abate COPD’s debilitating flare-ups. One new line of enquiry 
is to stimulate the production of antioxidants to neutralize the 
free radicals that trigger the violent biochemical cascades that 
lead to the lungs’ deterioration (S4). If the damage wrought 
is irreversible, artificial lungs might be an option (S12). And 
there are tantalizing early results that suggest that vitamin D 
supplements can at least slow the lungs’ deterioration (S10). 

No remedy for COPD will be effective if the condition 
is not properly diagnosed. Indeed, too often, the disease is 
mistaken for other, less serious ailments, and the standard 
doctor's office test is prone to error (S8). 

While the decline in smoking in the West should lower 
the prevalence of COPD, other parts of the world have a 
rougher road to travel. COPD is expected to grow rapidly in 
China where tobacco use, smog and smoky cooking stoves 
portend a future of difficult breathing (S18). 

We acknowledge the financial support of Novartis Pharma 
AG in producing this Outlook. As always, Nature has full 
responsibility for all editorial content. 
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HEALTH IMPACT 


Breathless 


COPDis one of the world’s biggest killers, but awareness is 
low, diagnosis is often missed, and in many countries 
the extent of the problem is not even well-documented. 


BY AMBER DANCE 


orldwide, 65 million people with 
chronic obstructive pulmonary 
disease (COPD) are gasping for 


air — and with the World Health Organiza- 
tion (WHO) predicting the disease will vault 
from fifth to third in the leading causes of 
death globally by 2030, their breathless ranks 
are set to swell. Governments, scientists and 
pharmaceutical companies are taking several 
approaches to ease the burden of COPD, both 
on the patients and on healthcare systems. 
COPD encompasses both emphysema and 
chronic asthmatic bronchitis; patients can suf- 
fer either problem or both simultaneously. 
Emphysema eats away at the lung’s air sacs, or 
alveoli, so that less surface area is available for 
gas exchange. Bronchitis constricts the airways 
entering the lungs and clogs them with mucus. 
The winded masses face a disease that is incur- 
able, even for those fortunate enough to receive 


a proper diagnosis and treatment for symptoms. 

Surveys suggest that COPD is grossly under- 
diagnosed in both developed and developing 
nations. People don't always report the relevant 
clinical signs, and some doctors do not use 
spirometry, the gold-standard method of 
diagnosis, because they cannot afford the 
machine or haven't heard of it. A patient undi- 
agnosed means a patient untreated, which 
could hasten death. 

Although reliable numbers on global COPD 
rates are hard to come by, the WHO estimates 
that 90% of COPD cases occur in developing 
nations. COPD is likely to become an especially 
serious problem over the coming decades in 
China, the world’s biggest producer and con- 
sumer of cigarettes. About a third of Chinese 
people smoke, among the highest rates in the 
world. Smoking-related diseases are the coun- 
try’s biggest killer, and COPD alone kills nearly 
two million people there each year. 

Smoking is the major risk factor for COPD, 
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but it is not the only one. Just 15% of smokers 
develop the disease. Presumably, their genetic 
makeup predisposes them to lung maladies. 
Scientists have known for decades that a 
mutation in the enzyme al-antitrypsin, found 
in 1-2% of people with COPD, puts people at 
greater risk — especially smokers. In the past few 
years, genome-wide screens have netted a slew of 
new gene candidates, though not all have been 
confirmed as risk factors. 

The manner in which smoke particles dam- 
age the lungs also remains uncertain. In addition 
to imbalances in enzymes such as antitrypsin, 
another potential culprit is the immune system. 
Most smokers suffer from inflamed airways. 
But those who develop COPD endure sustained 
inflammation, even after they quit smoking. 
Some scientists see this as evidence that COPD is 
an autoimmune disease. Supporting the hypoth- 
esis, researchers have found that some people 
with COPD possess antibodies against some of 
their own proteins. 

Another potential modulator of the immune 
response in COPD is vitamin D. Several studies 
have shown that people with COPD are more 
likely to have a deficiency in the vitamin, and 
the less vitamin D they have, the worse their 
lungs work. Vitamin D may squelch the nega- 
tive inflammatory response while promoting 
beneficial immunity, but researchers have yet to 
confirm its effect and work out the details. 

Pharmaceutical companies are pursuing new 
and improved versions of current COPD treat- 
ments, such as the bronchodilators that relax air- 
way muscles and ease breathing. Both Novartis, 
headquartered in Basel, Switzerland, and Pearl 
Therapeutics, headquartered in Redwood City, 
California, report promising data. And a col- 
laboration between GlaxoSmithKline (GSK), 
headquartered in Brentford, UK, and Thera- 
vance, a biopharmaceutical company based 
in South San Francisco, California, is running 
several trials with a dual-drug combination. 
But another combined therapy from GSK and 
Theravance, Relovair, has yielded mixed results 
thus far. Fortunately, drugs are not the only 
option. Researchers are also at work on bioarti- 
ficial lungs that they hope could reduce or even 
eliminate the need for donor organs. 

Cigarette smoke is packed with free radicals, so 
antioxidants — which can neutralize their dam- 
aging effects — have also been tested in clinical 
trials. The results have been inconsistent, perhaps 
because each medicine only mops up a subset of 
the different types of free radicals ravaging the 
lungs. Some scientists have now turned their 
attention to Nrf2, a DNA-binding protein that 
switches on many of the body’s own antioxidants. 

These treatments will be important weapons 
in the fight against COPD. But most are still con- 
fined to the lab rather than nearing the clinic — 
and scientists and health professionals must take 
aim at COPD now if they are to stem its rise. m 


Amber Dance is a freelance science writer 
based in Los Angeles, California. 
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COPD IN THE UNITED STATES 


INSIDE THE LUNGS 


It is not a single disease but a constellation of symptoms. 
And comes in two forms: emphysema and chronic bronchitis. 
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Patients with COPD suffer primarily because their alveoli, 

or air sacs, lose their elastic quality or are destroyed. The 

airways may also become thick and stiff, inflamed, or clogged 

with mucus, so that patients can take only shallow breaths. A person dies of 
COPD-related illness 

every four to five minutes 
in the United States. 


SMOKE SCREEN 


A DISEASE ON THE RISE 


Of the ten most common causes of death in the United States, 
COPD is the only one with an increasing death rate. 
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The rise was fuelled partly by a sharp increase of COPD among 
women, who in 2000 became more likely than men to die of the 
disease owing to several trends including more women smoking at 
a younger age and doctors’ increasing awareness of the syndrome. 


Tobacco smoking is the primary risk factor for COPD in industrialized countries. This map shows the prevalence 


of smokers in the United States (in colour) and the prevalence of COPD rates (by elevation). 


Other risk factors, 
including age and air 
pollution, can cause 
certain counties with 
low rates of smoking 
to have a high 
burden of COPD. 
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The mortality rate among native Alaskan people 
is almost double that of white US citizens. 


Smoking and 
air pollution are 
both risk factors 
for COPD in 
Appalachia. 


COPD rates 


Counties are elevated by 
COPD prevalence 


Smoking rates 
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Data for ‘A disease on the rise’ came from the American 
Lung Association. Data for the map came from the American 
Lung Association and the CDC’s Behavioural Risk Factor 
Surveillance System. Map illustrated by Chris Wilson. 


27 SEPTEMBER 2012 | VOL 489 | NATURE | S3 


— 


ft. 


Immune cells taken from the lung seen expressing the protein Nrf2 (brown) which acts as a master switch for genes that encode protective antioxidants. 


A radical treatment 


Researchers are counting on drugs that activate a master switch for antioxidant genes to 
protect lung tissue of COPD patients from an onslaught of free radicals. 


BY KEN GARBER 


chronic obstructive pulmonary disease 

(COPD), a progressive condition charac- 
terized by lung damage, a narrowing of the air- 
ways and difficulty breathing. No surprise given 
that each puff of cigarette smoke contains more 
than 10”° free radicals — atoms or molecules 
with unpaired electrons that can react violently 
with other molecules, setting off chain reac- 
tions that damage proteins, lipids, and DNA. “In 
excess, [free radicals] will cause injury, and will 
cause impairment in the repair process,’ says 
biochemist Irfan Rahman of the University of 
Rochester in New York. “That’s what happens 
in lungs in response to tobacco smoke” 

COPD continues to progress long after a 
person quits smoking, and free radicals are 
partly to blame. COPD involves chronic 
inflammation, and inflammatory cells release 
abundant free radicals. As a result, the lungs of 
COPD patients exist in a constant state of oxi- 
dative stress — an imbalance of free radicals 


moking is the most common cause of 


and antioxidants. Antioxidant enzymes convert 
free radicals to less reactive molecules, and anti- 
oxidant free radical scavengers donate electrons 
to halt free radical chain reactions. In COPD, the 
shower of free radicals overwhelms these anti- 
oxidants, causing cell death and tissue damage. 

But clinical trials of antioxidants have mostly 
failed to prove they work as a COPD treatment. 
That may be because each chemical antioxidant 
molecule can extinguish only one target mol- 
ecule; it is therefore impossible to quench more 
than a fraction of the excess free radicals even 
using high doses of antioxidants. Such anti- 
oxidants behave in a “sacrificial” manner, says 
Paul Kirkham, a biochemist at Imperial Col- 
lege London. “Once it’s been used, it’s gone.” 
Another problem, says Kirkham, is that indi- 
vidual antioxidants may not reach the cellular 
compartment where they are needed most. 

A new approach to COPD promises to 
solve these problems. Several drug companies 
are developing compounds that activate the 
DNA-binding protein Nrf2 (nuclear erythroid- 
related factor 2, also known as Nfe212), which 
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acts as a master switch of genes that encode 
antioxidants. Nrf2 emerged from obscurity in 
1997, when biochemist Masayuki Yamamoto 
at the University of Tsukuba in Japan showed 
that it activates an entire class of detoxifying 
enzymes. Among these enzymes are many that 
generate critical antioxidants. 

Drugs that activate Nrf2, in theory, would 
solve the problems of dose and compart- 
mentalization that have so far derailed 
other antioxidant therapies. To begin with, 
Nrf2-activated enzymes aren't spent each 
time they do their job. “They can effectively 
regenerate themselves — that’s the beauty of 
an enzyme,’ says Kirkham. “So you need much 
less of it Together, these enzymes neutralize 
a variety of free radical molecules, not just 
one, and in all important cell compartments. 
Finally, Nrf2 induces the expression of proteins 
that can repair some of the damage inflicted by 
free radicals. 

Researchers have been studying Nrf2 in ani- 
mal models of COPD for more than a decade, 
and in human tissue. It now appears that Nrf2 
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activators may potentially benefit 
COPD patients beyond trigger- 
ing antioxidant and detoxification 
enzymes. Following Yamamotos 
discovery, researchers assumed 
that Nrf2 activators would need 
to be administered early in the 
disease — in particular, before 
the lung tissue destruction typi- 
cal of emphysema, one of the two 
main manifestations of COPD. 
(The other is bronchitis, inflam- 
mation of the bronchi, the main 
airways that go into the lungs.) 
But work by toxicologist Shyam 
Biswal at the Johns Hopkins Uni- 
versity School of Public Health in 
Baltimore, Maryland, and others 
suggests that Nrf2 activators might 
be effective not only in preventing 
COPD progression but in treat- 
ing advanced cases. And, unlike 
bronchodilator drugs, says Biswal, 
Nrf2 activators might not just treat 
the disease’s symptoms but also 
arrest its course. 

In 2004, Biswal’s group found 
that mice genetically engineered 
to lack Nrf2 developed early- 
onset emphysema with more 
severe inflammation than did wild-type mice, 
and that this was a result of oxidative stress’. 
Moreover, exposing the mice without Nrf2 to 
cigarette smoke over six months caused com- 
paratively more lung damage. Four years later, 
three different research groups reported that 
Nrf2 activity declines as COPD progresses 
in humans — suggesting some protective 
role for Nrf2. Then Biswal’s group found that 
lack of Nrf2 activity in human COPD lungs 
caused defective protein clearance, which led 
in turn to more oxidative stress and cell death. 
Adding an Nrf2 activator to cells prevented 
these effects. The overall picture emerging 
from these studies is that Nrf2 is a key stress- 
response factor whose absence worsens COPD. 

Nrf2 may also help clear harmful bact- 
eria from the lungs of COPD patients. Such 
bacteria can cause COPD exacerbations — 
acute and potentially fatal bouts of increased 
coughing, mucus production and shortness 
of breath. Biswal’s group found that adding an 
Nrf2 activator to cultures of macrophages (cru- 
cial host defense cells) taken from the lungs 
of COPD patients restored the cells’ ability to 
clear bacteria in culture and in mice’. Biswal’s 
group went on to show that Nrf2 in mac- 
rophages activates a scavenger receptor that 
recognizes pathogenic bacteria and enables 
the macrophages to clear infection — thus, in 
theory, limiting COPD exacerbations. 

Finally, Nrf2 plays a crucial role in overcom- 
ing treatment resistance to corticosteroids, a 
mainstay in asthma therapy, which doctors 
often prescribe to COPD patients. Unfortun- 
ately these steroid hormones provide little 


Cells in 
the lung 
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Cigarette smoke and other toxins cause lung cells to activate 
transcription factors NF-«B and Nrf2. In the nucleus, NF-«B 
promotes inflammation whereas Nrf2 turns on antioxidant 
genes and dampens inflammation. 
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or no therapeutic benefit in COPD. In fact, 
says Biswal, “steroids cause more problems 
in COPD than they help,” mainly because 
steroid treatment can lead to pneumonia. An 
explanation for this corticosteroid resistance 
emerged in 2005, when pulmonologist Peter 
Barnes at Imperial College London reported 
that oxidants in cigarette smoke inactivated 
the enzyme histone deacetylase 2 (HDAC2), 
which normally blocks the expression of 
inflammatory genes. Corticosteroids reduce 
inflammation by signalling through a receptor 
that recruits HDAC2 and represses inflamma- 

tory gene expression. 

But in smokers and 


“You've got to COPD patients, 
do the clinical oxidative stress 
trial and then inactivates HDAC2 
see what resulting in continu- 
happens.” ous inflammation and 


tissue damage. 

By reducing oxidative stress, Nrf2 acti- 
vators can reverse corticosteroid resist- 
ance, at least in cell culture. Biswal and his 
Johns Hopkins colleague Robert Wise are 
planning a clinical trial of treating COPD 
patients with the steroid prednisone and 
the Nrf2 activator sulforaphane. “This 
would be a logical approach,” says Barnes. 
Sulforaphane is already in a phase II 
clinical trial for COPD, sponsored by the US 
National Institutes of Health. Cells extracted 
from the lungs of COPD patients will be tested 
to see if the drug raises levels of Nrf2. 

But sulforaphane is not an ideal Nrf2 acti- 
vator. It does activate Nrf2, but not always 
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very potently, and because it tar- 
gets many other proteins it could 
potentially cause collateral dam- 
age. And although safe at low 
doses (it is derived from broc- 
coli sprouts), sulforaphane can 
be toxic at high doses. Hence an 
intense search for more selective 
Nrf2 activators is underway. 

Drug companies are already 
heavily involved. In December 
2011, in one of the largest pre- 
clinical deals ever, Abbott Labo- 
ratories, headquartered in Abbott 
Park, Illinois, agreed to pay 
US$400 million to Reata Pharma- 
ceuticals, based in Irving, Texas, 
to license Reata’s second-gen- 
eration Nrf2 activators. Abbott 
executive vice president Thomas 
Freyman told investors in January 
2012 that he expected the first of 
those compounds to enter clinical 
trials later in the year. Meanwhile, 
Cureveda, a Baltimore, Maryland, 
biotech firm cofounded in 2010 
by Shyam Biswal, is screening 
compound libraries from the UK 
pharmaceutical company Glaxo- 
SmithKline for pro-Nrf2 activity. 
Other companies are interested in targeting 
Nrf2, including Pfizer and Novartis, which 
are part of a consortium studying new thera- 
peutic strategies for treating COPD, according 
to Kirkham. 

Despite their great potential in COPD, Nrf2- 
activating drugs present possible safety issues. 
Nrf2 is usually inactive in cells, but is activated 
by spikes in oxidative stress. Using drugs to 
keep Nrf2 turned on could disrupt beneficial 
oxidative processes in cells. “The cell is actu- 
ally maintained ina very fine redox [oxidation- 
reduction] balance, because it actually needs 
oxidative stress to do some of its signaling,” 
says Kirkham. Biswal concedes that continu- 
ous activation of Nrf2 could cause side effects. 
“You've got to do the clinical trial and then see 
what happens,” he says. 

Then there is the evidence that Nrf2 might 
increase the risk of cancer. In 2008, a group 
at Japan’s National Cancer Research Institute 
in Tokyo reported apparent Nrf2-activating 
mutations in human lung tumours and in head 
and neck tumours’. In their report, they con- 
tend that ongoing activation of Nrf2 might give 
cancer cells “undue protection from their 
inherently stressed microenvironment.” In 
2011, two groups reported that in mouse 
models Nrf2 is indirectly activated by 
mutations that cause hereditary kid- 
ney cancer in humans. And a group from 
the Cambridge Research Institute, part 
of Cancer Research UK, found that three 
common cancer-causing genes acti- 
vate Nrf2, which may promote tumor- 
igenesis by reducing oxidative stress in 
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pre-cancerous cells’. 

But, paradoxically, activated Nrf2 can also 
have anticancer effects. Several studies have 
found that Nrf2-activating compounds can 
prevent or suppress cancer in mouse models. 
Anda recent study found that sulforaphane did 
not promote lung cancer in mice. Pharmacolo- 
gists Mike Sporn and Karen Liby at Dartmouth 
College in Hanover, New Hampshire, explain 
that Nrf2 can either promote or suppress can- 
cer depending on the cellular context’. 

Safety concerns about Nrf2 activators in 
COPD have been lessened by trials of two 
Nrf2-activating drugs in other diseases. 
Weston, Massachusetts-based Biogen Idec 
recently completed phase III clinical trials 
of dimethyl fumarate (DMF) for multiple 


What exactly causes the destruction of 

lung tissue in emphysema? Free radicals 
definitely contribute, but for half a century, 
the prevailing hypothesis has been that 
emphysema, one of the two main aspects 
of COPD, develops when the activity of 
proteases — enzymes that digests proteins 
— falls out of balance with antiprotease 
activity. Proteases normally help maintain 
lung health by clearing roadblocks from 

the path of migrating immune cells hunting 
pathogenic viruses and bacteria. But 
cigarette smoke stimulates immune cells 
such as macrophages and neutrophils to 
release too many proteases, and wanton 
destruction of proteins ensues. Mysteriously 
and inexorably, this process continues in 
COPD patients even after they stop smoking, 
as emphysema slowly suffocates them. 

That mystery is now beginning to be 
solved, as clues implicate autoimmunity. 
Th17 cells — a specialized class of CD4* 

T cells, the cell type that orchestrates the 
adaptive immune response — appear to 
be central to the pathology. Therefore, 
therapies targeting these cells might be a 
promising strategy. 

Early evidence for autoimmunity in COPD 
came in 2007, when Farrah Kheradmand, a 
pulmonologist and immunologist at Baylor 
College of Medicine in Houston, Texas, isolated 
CD4* T cells from the blood of ex-smokers 
with COPD®. Elastin is an important structural 
protein in the lungs’ connective tissue, and 
when Kheradmand added elastin fragments 
to these T-cell cultures, the T cells responded 
by releasing inflammatory cytokines as 
if the cells had encountered a microbial 
pathogen. “A large number of patients who 
have emphysema [who] stop smoking and 
continue to have the disease will have these 


sclerosis. Although DMF wasn’t designed 
as an Nrf2 activator, the company claims it 
works at least partly that way. And Abbott 
and Reata are jointly developing bardoxolone 
methyl, another unintentional Nrf2 activa- 
tor that’s now in phase III trials for chronic 
kidney disease in type 2 diabetes. Side effects 
of bardoxolone methyl have been mild. Side 
effects reported for DMF include abdominal 
pain, diarrhea, flushing and headaches. On the 
other hand, it is too early to assess the risk of 
cancer — any cancers would likely take many 
years to appear. 

Clinical trials should soon determine if Nrf2 
can help resolve the many problems in COPD, 
from oxidative stress to inflammation. “COPD 
is a multifactorial disease — it’s one of the 


AUTOIMMUNITY 
The T-cell connection 


autoreactive T cells,’ says Kheradmand. 

Why the body attacks its own lung tissue 
remains unknown. But by isolating the cells 
driving this autoimmunity, scientists can 
identify potential drug targets. Th17 cells, 
which secrete the inflammatory cytokine 
IL-17, are the main drivers of autoimmunity 
in rodent models of rheumatoid arthritis, 
psoriasis and multiple sclerosis. Th17 cells 
are also present in COPD. In 2009, 
Kheradmand’s group found that culturing 
certain cells of emphysema patients with 
their own T cells drove those T cells to release 
IL-17, which in turn led to the production of 
destructive proteases’. 

Evidence for IL-17 driving autoimmunity 
in COPD continues to be found. In 2011, 
University of Pittsburgh, Pennsylvania, 
pulmonologist Jay Kolls reported that mice 
genetically lacking the IL-17 receptor did not 
develop emphysema despite six months of 
exposure to cigarette smoke’. More recently, 
Kheradmand and collaborators at the MD 
Anderson Cancer Center in Houston, Texas, 
genetically engineered mice to overexpress 
one form of IL-17 —IL-17A — and then 
exposed these mice to cigarette smoke 
for four months®. The mice developed an 
especially severe form of emphysema along 
with producing more destructive proteases. 
Both studies fingered IL-17 as a culprit in 
COPD autoimmunity. 

The picture emerging from these studies is 
that inhalation of cigarette smoke promotes 
the generation of T cells that express IL-17, 
which, in turn, causes the production of 
proteases that destroy lung tissue including 
elastin. Then, elastin fragments trigger an 
autoimmune response involving more IL-17- 
producing T cells, perpetuating a loop and 
more tissue destruction. 
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worst,’ says Biswal. Harnessing the power of 
Nrf2, that prolifically protective protein, may 
eventually prove to be the solution. m 


Ken Garber is a freelance science writer based 
in Ann Arbor, Michigan. 
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NO 


To disrupt this vicious cycle, pharma- 
ceutical and biotech companies could test 
drugs they already have in hand that target 
these cells and the cytokines they produce. 
Antibodies specific for IL-17 and its recep- 
tor have been very effective in clinical trials 
for psoriasis. As for COPD, “| don’t know if it 
is on the radar screen of these companies 
yet, but | think [IL-17] would be definitely an 
intriguing target,” says Kolls. He is collaborat- 
ing with an undisclosed company, testing 
whether anti-IL-17 receptor antibodies can 
treat COPD in mice. 

Others are more ambivalent. “It would 
certainly be worth studying IL-17 or IL-17 
receptor blockers,’ says Peter Barnes, a 
pulmonologist and COPD researcher at 
Imperial College London. He adds that such 
trials are planned. But he cautions that Th17 
cells may not play the same role in human 
COPD as in the mouse version of the disease; 
indeed, he says, the evidence for increased 
Th17 cells in human COPD lungs is “not as 
striking” as itis in mice. “The evidence for 
Th17 cells playing an important role in COPD 
is not fully established,” he adds. 

Kheradmand disagrees. She says that 
IL-17 is “very much present” in lung tissue 
of human COPD patients. And Kneradmand 
points out that each case of emphysema 
is different, with not all manifesting 
autoimmunity. Her lab at Baylor College 
is developing an assay to identify COPD 
patients with an autoimmune form of the 
disease driven by Th17 cells, and hopes 
companies will eventually test their drugs in 
such patients. “Wouldn’t it be wonderful,” she 
asks, “if we can just identify this particular 
individual... who does have this autoimmune 
component, to then offer him these 
biologics?” — K.G. 


PERSPECTIVE 


any people who develop chronic obstructive pulmonary dis- 
M ease (COPD) blame themselves rather than the tobacco indus- 

try, which continues to promote cigarette smoking though 
the dangers are well-known. This disturbing ‘blame and sham attitude 
among patients is based on one of the most well-established associations 
in clinical epidemiology: the causal relationship between cigarette smok- 
ing and COPD. Exposure to cooking stove smoke from biomass fuels is 
also an important COPD risk factor in some parts of the world. 

Although these environmental risk factors are strong, recent research 
suggests that genetics also plays a key role in COPD. Identifying genetic 
determinants and investigating their functions may lead to important 
progress in COPD pathobiology, diagnosis and treatment. It could also 
help patients understand that the disease is not their fault. 

Genetics provided one of the first clues regarding COPD pathogenesis. 
A small percentage of patients inherit severe al-antitrypsin deficiency 
(A1ATD), a well-characterized, rare syndrome that often includes COPD. 
The discovery of AlATD nearly 50 years ago led to the protease—antipro- 
tease hypothesis for COPD, which postulates 
that lung destruction results from an excess of 
protein-degrading enzymes relative to their 
enzyme inhibitors. This hypothesis remains 
important in current COPD pathobiology. 

Notall smokers are equally likely to develop 
COPD. An underlying susceptibility appears 
to run in families. Indeed, familial studies 
of severe, early-onset COPD patients with- 
out A1ATD suggest other genetic risk fac- 
tors. Smokers who are first-degree relatives of these subjects are about 
three times more likely to develop COPD than are smokers in general. 
Non-smokers who are first-degree relatives of these subjects are not at 
increased risk’, suggesting that genetic factors may interact with smoking. 

As with other complex diseases — those influenced by multiple genetic 
and environmental factors — many studies of candidate genes have failed 
to replicate. Yet subsequent genome-wide association studies (GWAS) 
have found four genomic regions associated with COPD that meet the 
stringent standard for statistical significance in genome-wide studies”. 

Two of these genomic regions, near the HHIP and FAM13A genes 
— the former a member of the developmentally essential hedgehog 
pathway, the latter a gene of unknown function — are also associated 
with variations in lung function levels in samples taken from the gen- 
eral population’. An association with lung function does not in and of 
itself prove susceptibility to COPD; genes that influence traits that vary 
among healthy people, such as height, are not always the same ones 
that influence pathological conditions, such as dwarfism. However, the 
associations of COPD to genomic regions near HHIP and FAM13A are 
convincing. They have been replicated in several studies, and a potential 
functional genetic variant that regulates HHIP gene expression has been 
found upstream from HHIP’. Further study of HHIP and FAM13A may 
identify new biological pathways involved in COPD. 

The other two regions identified by GWAS, on chromosome 15 and 
chromosome 19, include many genes of interest, including several related 
to nicotine addiction. The chromosome 19q region has been associated 
with smoking behaviour; it is the location of the gene CYP2A6, involved 


GENETICS COULD HELP 
PATIENTS REALIZE IT’S 
NOT THEIR FAULT 


How can genetics help? 


Smoking and COPD have one of the strongest relationships in clinical 
epidemiology. But don’t forget the genetics, says Edwin K. Silverman. 


in nicotine metabolism. Similarly, the chromosome 15q25 region, which 
contains genes for several components of the nicotinic acetylcholine 
receptor, has been convincingly related to smoking behaviour. The same 
region also contains the gene IREB2, which encodes an iron binding 
protein that has been potentially linked to COPD susceptibility. 
The overwhelming association between COPD and smoking provides 
a unique opportunity to understand the relationships between environ- 
mental and genetic influences on risk of disease. In most complex dis- 
eases, environmental factors are either unknown or difficult to measure. 
Cigarette smoking behaviour, on the other hand, can be accurately quan- 
tified. Of course, genes and the environment are intertwined; genetic 
determinants of nicotine addiction may influence exposure to COPD’s 
key environmental risk factor. Furthermore, the risk from smoking sug- 
gests that epigenetic factors may also influence COPD pathogenesis’. 
The next generation of genetic research on COPD will include studies 
of rare genetic variants assessed by sequencing or genotyping the vari- 
ants in patients’ exomes (the protein-coding portions of the genome) and 
ultimately by sequencing their entire genomes. 
Advances in computational biology and phe- 
notype characterization based on imaging 
and clinical observation will also be needed in 
coordination with genetic studies to dissect the 
COPD syndrome into groups of patients with 
different subtypes — an initiative with both 
diagnostic and therapeutic implications. COPD 
is highly heterogeneous, with variable amounts 
of emphysema and airway disease. Integrating 
genetic studies with chest computed tomography (CT) scans has already 
identified the BICD1 gene as a potential determinant of emphysema. 
However, comprehensive understanding of the complex pathobiol- 
ogy of COPD will probably require the integration of multiple -omics 
data types (for example, proteomics, transcriptomics and metabo- 
lomics) with detailed phenotypic assessment, epigenetics, and genetic 
variants using systems biology and network science approaches. 
Avoiding tobacco smoke will always be an essential public health 
message. But for millions already suffering, advances in genetics, 
pathophysiology and phenotyping may lead to new opportunities for 
specific diagnosis and personalized treatment of COPD. = 


Edwin K. Silverman is chief of the Channing Division of Network 
Medicine at Brigham and Women’s Hospital in Boston, Massachusetts, 
and associate professor of medicine at Harvard Medical School. 
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Impaired lung function, as measured by breathing apparatus, is a sign of COPD, but researchers are looking for easier and more reliable ways to diagnose disease. 


To catch a killer 


The first symptoms of COPD can be subtle, so the disease often goes undiagnosed. 
Researchers are searching for ways to detect the disease and to identify those most at risk. 


BY CASSANDRA WILLYARD 


breathing. His doctor diagnosed him with 

asthma, but asthma medications didn't seem 
to help. John’s non-identical twin brother, Fred, 
began having similar problems at about the same 
time. He too was told he had asthma, but asthma 
treatments didn't work for him either. 

For years, the two men struggled to find a way 
to alleviate their symptoms. Then, in 1989, John 
got a phone call from his brother. “He said, ‘I got 
good news and bad news,” John Walsh recalls. 
The good news was that the brothers, who were 
40 at the time, finally had a correct diagnosis. 
The bad news was that they hada genetic form of 
chronic obstructive pulmonary disease (COPD) 
— the same disorder that had killed their mother 
when they were 13 years old. 

The Walsh brothers havea rare mutation in the 
gene SERPINA 1 (present in 5% of people with 
COPD) that causes them to produce an abnor- 
mal version of a protein called a1-antitrypsin, 
which usually protects lung function. But the 
disease is common, killing millions of peo- 
ple each year, mainly as a result of exposure to 
tobacco smoke or airborne pollutants. 


[: 1985, John Walsh began having trouble 


According to the World Health Organization, 
65 million people worldwide have COPD, but 
the exact number is hard to pin down because 
many cases are not diagnosed. Data from the 
third National Health and Nutrition Exami- 
nation Survey in 2000 showed that 24 million 
people in the United States had impaired lung 
function that might indicate COPD, but less than 
halfhad received a diagnosis of COPD. 

Lack of awareness is part of the problem. 
COPD hasnt garnered the recognition that other 
chronic illnesses such as cardiovascular disease 
and cancer have, says Bartolome Celli, a doctor 
who specializes in lung diseases at Brigham and 
Women’s Hospital in Boston, Massachusetts. 

John Walsh — who is now president of the 
COPD Foundation, a non-profit organization 
based in Washington DC that advocates research 
and education — and health officials around the 
world are working to change that by launching 
massive awareness campaigns. Meanwhile, 
researchers are searching for new ways to iden- 
tify individuals with COPD. 


A SILENT EPIDEMIC 
COPD comes on slowly, typically in individu- 
als over the age of 50. The disease “sneaks up on 
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you’, says James Kiley, director of the division of 
lung diseases at the National Heart, Lung, and 
Blood Institute (NHLBI) in Bethesda, Maryland. 
People often attribute their symptoms to normal 
ageing or to being out of shape, and thus they fail 
to seek medical care. For smokers, there may also 
bea sense of “I did it to myself”; says Roger Gold- 
stein, a doctor who specializes in respiratory 
medicine at the University of Toronto in Canada. 

Because COPD symptoms resemble those of 
other conditions such as asthma, even patients 
who visit the doctor’s office might be incorrectly 
diagnosed. In the early stages of the disease, peo- 
ple complain of “vague and minor symptoms 
that the doctor doesn’t always associate with 
COPD’, says Leonard Fromer, a family doctor 
who specializes in lung diseases at the University 
of California, Los Angeles. “It could be some- 
thing like ‘I can't play three sets of tennis any- 
more. I get tired too quickly” In some cultures, 
the doctor might not even ask about patients’ 
smoking habits for fear that this might be 
“digging too much into people's private habits,’ 
says Anne Frolich, a doctor who studies chronic 
diseases at the University of Copenhagen. 

Even a textbook case of the disease may be 
missed if the patient doesnt fit the stereotypical 
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COPD profile. Doctors have long been trained 
that COPD is a disease of elderly men who 
smoke, says Fromer. In 2001, for example, a 
team of researchers presented 192 general prac- 
titioners in the United States and Canada with 
a hypothetical case description that was indica- 
tive of COPD (Chapman, K. et al. Chest 119, 
1691-1695; 2001). When the researchers said 
that the patient was a man, 58% of the doctors 
gave COPD as the most probable diagnosis. That 
dropped to 42% when the researchers said the 
patient was a woman. Yet, in the United States 
at least, more women than men are being diag- 
nosed now, and more women have died from 
COPD each year since 2000 — perhaps because 
women are biologically more susceptible to devel- 
oping the disease than men or perhaps because, 
as a group, they started smoking later than men. 
Among men and women, awareness of COPD 
is on the rise. In 2007, the NHLBI launched the 
Learn More Breathe Better campaign. And, in 
February 2010, an awareness campaign called 
DRIVE4COPD was launched by the German 
pharmaceutical company Boehringer Ingel- 
heim, which markets the widely used COPD 
medication Spiriva (tiotropium bromide) 
together with pharmaceutical giant Pfizer. 
Boehringer Ingelheim selected stock-car racer 
Danica Patrick as its celebrity spokesperson, 
and the DRIVE4COPD campaign, now led by 
the COPD Foundation, has screened nearly 2.5 
million people via an online questionnaire. Par- 
ticipants are encouraged to share the completed 
questionnaire with their doctor or other health- 
care professionals. These campaigns may be pay- 
ing off. A 2011 web-based survey by the NHLBI 
found that 71% of US adults say that they're 
aware of the disease, up from 65% in 2008. 


SEEKING THE SICK 
In addition to lack of awareness, another fac- 
tor that contributes to underdiagnosis is the 
diagnostic test itself. General practitioners and 
pulmonary specialists diagnose COPD based 
on a patient’s medical history, symptoms and 
alung function test called spirometry. The test 
requires the patient to blow into a tube as hard 
and fast as possible. A device then measures the 
total amount of air exhaled and the amount of 
air exhaled in 1 second. 

Spirometry is simple, but it is not flawless. 
If the patient accidentally fails to blow hard 
enough, the results may mean little. “Thirty 
to forty per cent [of the test results are] pretty 
much uninterpretable,” says David Mannino, 
an epidemiologist at the University of Kentucky 
in Lexington. Mannino is interested in devising 
alternative tests, such as one “where you see 
what happens to a person's blood oxygen as they 
hold their breath.” But this research is still in the 
earliest stages: Mannino is currently putting 
together grant proposals. Other researchers are 
beginning to look for diagnostic biomarkers in 
the blood, urine and sputum. A peptide called 
N-acetyl-proline-glycine-proline, a by-product 
of the breakdown of collagen, has been found 


in the sputum of COPD patients but not in the 
sputum of people without the disease. In addi- 
tion, the protein fibrinogen, a marker of inflam- 
mation, appears to be elevated in the blood of 
people who have COPD, but it may be more 
useful for predicting future risk of exacerbations 
than as a diagnostic tool. One NHLBI initiative, 
the Subpopulations and Intermediate Outcome 
Measures in COPD Study (SPIROMICS), aims 
to identify molecular markers of disease progres- 
sion. Such markers may enable doctors to “find 
[pulmonary] lesions extremely early,’ Kiley says. 

Besides developing better tools for diagnos- 
ing COPD, researchers are also trying to find 
ways to target screening efforts. Mass spirom- 
etry screening doesn't seem to bea good option. 
In 2008, the US Preventive Services Task Force 
found that health workers would need to screen 
roughly 450 adults between the ages of 60 and 
69 to pick out a single person who might later 


Damaged bronchi lead to a build-up of mucus 
and swollen bronchioles (orange). 


develop symptoms of COPD severe enough to 
require a trip to the emergency department. 
Mannino and Fernando Martinez, a lung 
specialist at the University of Michigan in Ann 
Arbor, were recently awarded a multimillion dol- 
lar grant by the National Institutes of Health to 
develop a more targeted method of diagnosis that 
could be used for screening programmes. They 
plan to combine a series of five simple questions 
on COPD risk factors — for example smoking 
status, wheezing anda chronic cough — with an 
easy-to-use, hand-held device that costs about 
US$30 and measures peak flow (the fastest 
speed at which an individual can breathe out). 
Spirometers, by comparison, generally cost sev- 
eral thousand dollars each, and health workers 
need special training to learn how to use them 
correctly. Both factors may have led to the under- 
use of spirometers in developing nations (see 
‘Where there's smoke; page S18). Patients with a 


normal peak flow are unlikely to have clinically 
significant COPD. Patients who fare poorly on 
peak-flow tests could undergo a more thorough 
assessment, including spirometry testing. 

COPD occurs when the tiny sacs inside the 
lungs, called alveoli, become damaged or chroni- 
cally inflamed. This damage or inflammation 
leads to breathing troubles. Spirometers and 
other airflow-sensing devices give doctors infor- 
mation about airway obstruction, but such tests 
cannot directly assess damage to lung tissue, says 
Jan-Willem Lammers, head of the respiratory 
medicine department at the University Medical 
Centre Utrecht in the Netherlands. This dam- 
age can be seen in computed tomography (CT) 
scans. In fact, research conducted as part of the 
COPDGene study, which aims to identify com- 
mon genetic factors that may subtly predispose 
people to the disease, found that CT imaging 
can identify lung deterioration in an individual 
before considerable airway obstruction occurs. 
The guidelines don't recommend treating indi- 
viduals who are asymptomatic, but Walsh and 
others hope that therapies might one day be able 
to prevent symptoms in those who are found to 
have early stages of COPD-like lung damage. 

CT scans aren't likely to become a screening 
tool just for COPD though. “That's going to be 
way too expensive, and people are going to be 
worried about the exposure to radiation) Walsh 
says. However, joint guidelines issued in May 
2012 by the American College of Chest Phy- 
sicians and the American Society of Clinical 
Oncology recommend that former and current 
heavy smokers of 55 to 74 years of age have a CT 
scan each year to screen for lung cancer. Those 
same images could also reveal signs of COPD. 

Clearly, a diagnosis isn’t a cure. It is unclear 
whether detecting the disease early will actu- 
ally benefit many patients. Will it help them live 
longer or improve the quality of their lives over 
the long term? Research addressing these ques- 
tions is surprisingly scarce, says Mannino. 

But an earlier diagnosis might have changed 
things for the Walsh brothers. Today, John 
Walsh is hooked up to an oxygen tank when he 
sleeps. His lung function is one-third of what it 
should be for a man his age. His brother Fred 
Walsh needs oxygen around the clock, and he is 
waiting for a lung transplant. John cant help but 
wonder how their lives might have been different 
had they been correctly diagnosed sooner. If the 
men had known that their mother died of COPD 
— which they didn’t fully realize until they them- 
selves were diagnosed — would Fred have taken 
up smoking? Did Fred’s lifestyle choices make his 
disease more severe than that of his brother, who 
has never smoked? 

The Walsh brothers can't change their own 
past. John, however, hopes that organizations 
such as the COPD Foundation can change the 
future for others, and soon, he says: “?’m an 
impatient patient” m 


Cassandra Willyard is a freelance writer based 
in New York. 
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Vitamin D,, shown in its crystalline form, can be used as a dietary supplement and is being studied for its therapeutic effects in patients. 


Ni 


The vitamin D complex 


Many COPD patients are deficient in vitamin D, a condition that can lead to bone problems 
as well as difficulty breathing. Can dietary supplements be of help? 


BY THEA SINGER 


y sister, Candice Singer, was a smok- 
Me fiend. Hooked since age 12, she 

was up to two packs a day by the time 
she started university, eventually rolling her 
own cigarettes to save money. In 2010, at age 
50, she was diagnosed with chronic obstruc- 
tive pulmonary disease (COPD) and needed 
to wear an oxygen mask at night to regulate her 
breathing while she slept. 

When she finally quit smoking, seven 
months ago, her vitamin D level was an alarm- 
ingly low 8 nanograms of 25-hydroxyvita- 
min D (25-[OH]D) — the major circulating, 
though inactive, form of vitamin D and used 
to measure sufficiency — per milliliter ofblood 
serum. The Institute of Medicine (IOM), a 
non-profit group afilliated the US National 
Academy of Sciences, defines vitamin D 
sufficiency at over twice Candice’s level — 
20 ng/mL. And The Endocrine Society, an 
international group of endocrinologists, set an 
even higher sufficient range of 40-60 ng/mL. 


That's for good reason. Vitamin D deficiency 
can lead to osteoporosis and osteomalacia, a 
fragility anda softening of the bones. Candice 
came to know the symptoms, with pains in 
her feet and back. And new research suggests 
that patients like Candice have other things 
to worry about, as vitamin D deficiency may 
affect more than the skeleton — even the abil- 
ity to breathe. 


A COPD-VITAMIN D LINK? 

COPD is primarily an inflammatory disease. 
The predominating hypothesis holds that ciga- 
rette smoke damages the lung’s tissue, spark- 
ing an innate immune response. Immune cells, 
including macrophages and neutrophils, rush 
into the lungs to protect the cells lining the 
airways from the smoke, releasing reactive, 
oxygen-containing molecules along the way. 
Antimicrobial peptides, a group of molecules 
that damage and kill microorganisms, join the 
fray, as do pro-inflammatory T cells stimu- 
lating the production of antibodies of as yet 
unknown specificity. 
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In smokers, this proinflammatory response 
is relentless. In the emphysema form of 
COPD, holes appear in the alveoli — the tiny, 
balloon-like structures in the lungs where 
oxygen and carbon dioxide are exchanged. 
And in response to noxious stimuli, the 
smooth muscle beneath the epithelial and 
connective layers of tissue contracts and 
expresses adhesion molecules, cytokines, 
chemokines and growth factors’. Over the 
years, this muscle thickens. “Think of it as 
weight lifting,” says Reynold Panettieri Jr, 
professor of medicine at the University of 
Pennsylvania in Philadelphia. “If you keep 
lifting weights, your muscle bulks up — it 
gets thicker. And the problem with thicker 
is that the [airway becomes narrower] simply 
by the increased mass of the muscle.” Hence 
the shortness of breath that characterizes 
COPD. Most of the inhalant corticosteroids 
and some other medications for COPD aim 
to relax that stiffened muscle. 

This is where vitamin D comes in. Nearly 
every cell in the body has a surface-bound 
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receptor that directs vitamin D to the nucleus. 
In fact, about 3% of the human genome is 
regulated by the active form of vitamin D, 
1,25-dihydroxyvitamin D*. Studies of human 
cells in culture, have shown that vitamin D 
stunts the growth of human airway smooth 
muscle cells®. This effect, says Panettieri, is 
even more pronounced than that induced by 
inhaled steroids. Moreover, during an infection 
—acommon occurrence in COPD patients 
given their compromised lungs — vitamin D 
aggravates the misfiring immune response. 
Studies have also found a link between 
vitamin Ddeficiency and autoimmune diseases 
including multiple sclerosis and rheumatoid 
arthritis’**, 


SEARCHING FOR PROOF 

It is not known whether vitamin D deficiency 
can cause COPD, but there’s evidence that 
it may be involved in its pathogenesis. One 
epidemiological study found vitamin D defi- 
ciency in more that 60% of patients with severe 
COPD, and the more severe the disease, the 
worse the deficiency’. Research presented in 
May 2012 at the American Thoracic Society's 
annual meeting found that a three-year decline 
ina crucial metric of breathing, forced expira- 
tory volume (FEV 1), was linked to vitamin D 
deficiency. The decline was so steep it is com- 
parable to the effects of smoking®. Anda 2011 
study in mice found that vitamin D deficiency 
causes deficits in lung function’. 

So if vitamin D levels correlate with the sever- 
ity of COPD, could a dietary supplement of the 
vitamin — which the body synthesizes from 
the sun’s ultraviolet-B (UVB) — help abate the 
debilitating condition? There are few studies 
that have given vitamin D to COPD patients. 

A randomized, double-blind placebo- 
controlled study, the gold standard of clinical 
trials, led by pulmonologist Wim Janssens at 
University Hospitals Leuven, Belgium, had 
mixed results. Patients 50 years of age or older 
with moderate to very severe COPD were given 
large doses of vitamin D ora placebo at 4-week 
intervals for a year. The researchers then 
recorded the incidence of COPD exacerbations 
— a worsening of respiratory symptoms over 
48 hours — and monitored blood serum levels 
of 25-[OH]D. 

In the supplemented group, mean levels 
of 25-[OH]D increased (to 52 ng/mL) com- 
pared to the placebo group. But there was no 
improvement in the timing and frequency of 
exacerbations, FEV 1, or rates of hospitalization 
and fatality®. “The main message of the paper is 
anull message,’ says Janssens. 

Despite discouraging results, Janssens 
remains convinced that vitamin D deficiency 

plays a role in COPD 
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trial’s data and found that those with the most 
severe deficiency — 30 patients with baseline 
serum 25-(OH)D levels below 10 ng/mL — had 
43% fewer exacerbations over the year. 

That analysis matters, says Panettieri, not- 
ing the treatment’s potential efficacy in certain 
subgroups. He explains that vitamin D, in high 
enough doses, can act as an anti-inflammatory. 
But, he says, the vita- 
min D levels induced 
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Former smoker Candice Singer now lives with COPD. 


use vitamin D as an anti-inflammatory,” he 
says. “We believe that if we used vitamin D in 
a more challenging approach to enhance the 
anti-inflammatory effects, maybe the study 
would have been positive, although more data 
is necessary to prove that.” 

Diane R. Gold and JoAnn E. Manson, pro- 
fessors of medicine at Harvard Medical School 
in Boston, Massachusetts, addressed the issue 
of vitamin D dosing in an editorial accompa- 
nying publication of the clinical trial. They 
cited the benefits of daily versus intermittent 
dosing in other, non-COPD-related vitamin D 
trials’. They also suggest that differences in 
participants’ physiology, immunology, and 
genetics may partly explain the null results in 
the general COPD population. COPD is not a 
monolithic disease: muscles thicken and tissues 
are destroyed to varying degrees, and molecular 
variants in the vitamin D binding protein — 
which transports vitamin D in the bloodstream 
— affect activation of immune cells including 
macrophages. In fact, Janssens had found that 
COPD patients with two copies of a particular 


variant in the gene for vitamin D binding pro- 
tein were more likely to be deficient in vitamin 
D deficiency’. 

Unraveling whether vitamin D deficiency 
is a cause of COPD requires more clinical 
data. Gold and Manson are leading one of the 
most extensive of those efforts. Their National 
Institutes of Health-funded Vitamin D and 
Omega-3 Trial (VITAL) is just starting; it will 
follow 20,000 participants age 50 and older as 
they receive 2,000 international units (IU) of 
vitamin D daily for five years. 


DEFICIENCY AS CAUSE? 

At least one study in mice sheds some light on 
the relationship between COPD and vitamin D. 
A team led by Graeme Zosky, who heads the Lung 
Growth and Respiratory Environmental Health at 
the Telethon Institute for Child Health Research, 
in Subiaco, Australia, fed one group of female 
mice a vitamin D-deficient diet and another 
group a diet with sufficient levels. They studied 
the offspring of both, looking for differences in 
lung physiology. They measured lung volume 
and dissected the lungs to take a look inside. 

The offspring of the vitamin D deficient 
mice — also deficient in vitamin D — had 
statistically significantly smaller lungs than 
normal: about 18% smaller in female offspring 
and 28% smaller in males’. “Our point was 
that vitamin D deficiency isn't causing gross 
structural changes in the lung, but it might be 
slowing down lung growth,” Zosky says. “What 
we don't know is what that means when you 
put disease on top of it. My suspicion is, if we 
introduced COPD pathology in these mice, the 
effect would be much larger” 

My sister Candice, for one, believes her lung 
capacity has increased since she started talk- 
ing vitamin D as part of a smoking cessation 
program. She began with a daily dose of 10,000 
[Us for four weeks, and continued with a daily 
dose of 5,000 IU (The IOM recommends just 
600 IU a day for adults up to age 70 and 800 IU 
a day for those 71 and older). Candice can't 
separate out the benefits of supplementation 
from those of not smoking, but she attributes 
her improved health to both. She has little 
trouble carrying her four-year-old daughter 
up the stairs now — whereas before, she says, 
“I couldn't do that without losing my breath” 


Thea Singer is a freelance writer in Brookline, 
Massachusetts. 
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Cadaveric or animal lungs might form the framework for new lungs made of a patient’s own cells. 


Artificial 


inspiration 


The donor lung's of the future — built from collagen or silicone 
rubber or engineered from donor organs stripped of 
their original cells — might give a new lease of life to 


COPD patients. 


BY ELIE DOLGIN 
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cent human lungs. “You guys are so cool,” 

she says, peering into a keg-sized glass 
bioreactor at Massachusetts General Hospi- 
tal (MGH) in Boston. The lungs float inside, 
puffing up as 40 litres of soapy fluid is pumped. 
through their network of blood vessels. Less 
than a week earlier, these lungs belonged to a 
47-year-old New Yorker, allowing him to take 
one last breath before dying of cardiac arrest. 
Now, his donated lungs have been stripped of 
their cells and genetic material. All that remains 
are two cone-shaped objects made of collagen 
and other structural proteins, dangling as 
though in suspended animation. 

Gilpin, a postdoctoral fellow, is using such 
protein scaffolds to try to grow a set of bio- 
artificial lungs in the laboratory. Together 
with Harald Ott, a cardiothoracic surgeon at 
MGH, Gilpin is devising a protocol to flush 
out the cells from donated lungs that are 
ineligible for direct transplantation. Even- 
tually, they plan to seed the support struc- 
ture with stem cells and to direct these cells 
to develop, or differentiate, into the vari- 
ous cell types in the lung, thereby rebuilding 
functioning tissue. Using the techniques the 
lab has developed so far, Ott has already engi- 
neered bioartificial rat lungs that can perform 
the appropriate gas-exchange functions when 
transplanted back into living rats'. Now, he and 
Gilpin are scaling up to humans. 

Success in their quest would bring good news 
to the estimated 65 million people worldwide 
who live with chronic obstructive pulmonary 
disease (COPD), a lung disorder mainly caused 
by exposure to tobacco smoke and airborne 
pollutants. Medication can often help treat the 
symptoms and complications of COPD, such 
as breathlessness and chronic coughing. But, 
for many people with severe forms of the dis- 
ease, the only option is to replace a bad set of 
lungs with a good set. Relatively few lungs are 
donated, however, and most of these are dam- 
aged and therefore unsuitable for transplanta- 
tion (see ‘Organ refit’). So, each year, around 
the world, only a few thousand patients with 
COPD receive a life-saving transplant. New 
sources of healthy lungs or devices that serve 
the same purpose are desperately needed. 

Such technologies are lacking. “Count the 
number of devices you can use if your heart is 
failing, and you can make a nice long list. But if 
your lung is failing, you have very few options,’ 
says Scott Merz, president and co-founder of 
Michigan Critical Care Consultants in Ann 
Arbor, a company that designs and develops 
‘lung-support’ technologies. 

Lab-grown lungs, reconstructed from dam- 
aged donor lungs and seeded with a patient's 
own cells, offer one attrac- 
tive option. What’s more, DNATURE.COM 
these bioartificial trans- Artificial pancreas 
plants could havea major _ tomake diabetes 
advantage over their moremanageable: 
natural counterparts: by __ go.tlature.com/spn6zs 
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incorporating a patient's own (autologous) 
cells, the lungs would be less prone to rejection 
by the immune system than lungs transplanted 
directly from a donor, and recipients wouldn't 
have to take lifelong cocktails of immunosup- 
pressive agents. However, considering that 
Gilpin, on this particular July morning, was 
only part way through removing the cells 
from her second set of human lungs and that 
no other team has reported success with this 
approach ona similar scale, the research is still 
along way from the clinic. 

“We're a couple of decades away from hav- 
ing an autologous, tissue-engineered lung,’ says 
Laura Niklason, a biomedical engineer at Yale 
University in New Haven, Connecticut, who 
has independently created bioartificial rodent 
lungs’. “There are technological hurdles, and 
there are stem-cell differentiation hurdles. But 
I don't think anything is insurmountable.” 


Surgeons strip lungs of their original cells. 


In the meantime, many academic and 
industry researchers are turning to mechani- 
cal devices that can help the lungs to carry out 
their main function: gas exchange, specifi- 
cally the removal of carbon dioxide from the 
bloodstream and the delivery of oxygen. The 
first such products are already on the market, 
and the next generation of ‘artificial lungs’ is 
not far behind. 


OUTSIDE THE BOX 

Novalung, a device manufacturer based in 
Talheim, Germany and spun off from Merz’s 
company, is the market leader in lung-support 
technologies. The company’s interventional 
lung assist (iLA) membrane ventilator takes 
blood from a patient’s thigh and circulates 
it under ambient pressure from the femoral 


artery through a rod lined with hundreds of 
tubes full of pure oxygen. Oxygenated blood 
then flows back into the femoral vein. This 
system, which was approved in Europe and 
a handful of other countries in the past ten 
years for providing respiratory support for up 
to 29 days, has mainly been used as a stopgap 
for people urgently awaiting lung transplants. 
But doctors are beginning to test whether this 
device, and similar technologies, can be used 
earlier in the treatment process to avoid the 
need for transplants altogether. 

Long before needing a new set of lungs, a 
patient with COPD will be well acquainted with 
the hospital emergency department because of 
frequent episodes of severe breathing difficulty. 
These attacks — called acute exacerbations — 
occur, on average, two to three times a year in 
people with moderate-to-severe COPD. To give 
the lungs a break and help the patient breathe 
during one of these episodes, doctors usually 
attempt to deliver air through a face mask. 
Although these masks are easier to apply and 
less cumbersome for patients to wear than the 
iron lungs of the early twentieth century, mask- 
based ventilation doesn’t work for everybody. 
For example, individuals who are not fully con- 
scious and cooperative are not considered good 
candidates. So, if the mask approach fails, doc- 
tors turn to more invasive techniques, in which 
a breathing tube is inserted into the windpipe 
and a ventilating machine pushes air directly 
into the lungs. Such mechanical ventilation can 
save lives during a COPD attack. But it also sig- 
nificantly increases the risk of pneumonia, and 
forcing the air into the lungs can cause injury. 

Lung support devices such as the iLA might 
obviate the need for invasive ventilation dur- 
ing exacerbations. For example, in a pilot study 
of the iLA system reported in June 2012 at the 
annual meeting of the German Society for 
Internal Intensive and Emergency Medicine 
in Cologne, doctors at the University Medical 
Center Hamburg-Eppendorf in Germany 
helped 13 of 14 patients with COPD avoid 
tracheal intubation. 

“You have something that breathes outside 
the patient, allowing the lungs to recover and 
heal rather than being forced to perform by 
mechanical ventilation,’ says Georg Matheis, 
a managing director of Novalung, based in 
Heilbronn, Germany. “This might affect the 
long-term cause of COPD by mitigating or 
avoiding the damage inflicted by these exac- 
erbations.” In this way, patients who are pro- 
tected against progressive damage by a system 
such as Novalung’s might ultimately live longer 
or have an easier time breathing. 

Novalung is now moving towards a new 
iLA design, with a pump that exchanges 
blood through the jugular veins, in the neck. 
This approach gives patients greater mobility, 
as well as lowers the risk of heart attack and 
other problems associated with an arterial 
puncture. But Novalung is not alone in pur- 
suing this strategy: another company, ALung 


ORGAN REFIT 
All donations accepted 


More than 80% of donated lungs worldwide 
are too damaged to transplant. These 

lungs can still be used to study tissue- 
engineering techniques, such as those 

used for generating bioartificial lungs in the 
laboratory of Harald Ott, a cardiothoracic 
surgeon at Massachusetts General Hospital 
in Boston. But, with many people dying 
while on the waiting list for a lung transplant, 
researchers are beginning to explore 
whether the unsuitable lungs might be of 
immediate benefit to patients, especially if 
some of the tissue damage can be repaired. 

Ina 2012 study, a British team of doctors 
assessed the survival of nearly 1,300 lung- 
transplant recipients, some 500 of whom had 
received lungs from donors who smoked®, 
which is permitted under UK policy. As might 
be expected, transplant recipients whose 
lung donors had not smoked survived longer. 
But recipients of smokers’ lungs fared better 
than patients with COPD who remained on 
the waiting list. “It’s better to get a lung with 
a smoking history than no lung at all,” says 
Lorraine Ware, a doctor who studies lung 
transplantation at the Vanderbilt University 
School of Medicine in Nashville, Tennessee. 

However, before doctors race to 
transplant suboptimal lungs from smokers, 
they might consider applying a recently 
developed technique called the Toronto 
XVIVO Lung Perfusion System. Developed 
by Shaf Keshavjee and his colleagues at 
the University Health Network in Toronto, 
Canada, the procedure involves pumping a 
protein and nutrient solution into the donor 
lungs and ventilating them with an oxygen 
gas mixture for several hours to repair 
damage caused by swelling or inflammation. 

Ina 2011 report, the Toronto team showed 
that only 15% of suboptimal lungs prepared 
in this way showed signs of acute organ injury 
three days after transplantation, in contrast 
to 30% of the supposedly healthy lungs that 
were transplanted without such preparation®. 
“The advantage isn’t just that we can use 
more lungs but that we also make these 
transplants safer and more predictable,” says 
Keshavjee, adding that around 20% of organs 
transplanted at the Toronto Lung Transplant 
Program undergo this perfusion treatment. 
(The Toronto program, like those in the UK, 
allows the use of smokers’ lungs, which 
Keshavjee says make up 30-40% of lungs 
considered for transplant.) 

“This is our first foray into partially 
engineering or repairing donor lungs,” 
Keshavjee notes. If he succeeds more 
broadly, maybe bioartificial lungs won’t be 
necessary after all. E.D. 
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Technologies based in Pittsburgh, Pennsylva- 
nia, is advancing a similar device as an alterna- 
tive to mechanical ventilation. 

In June 2012, at the 58th Annual Confer- 
ence of the American Society of Artificial 
Internal Organs in San Francisco, California, 
ALung reported success with its Hemolung 
device, showing that patients with severe 
exacerbations of COPD experienced, on aver- 
age, 28% reductions in their blood levels of 
carbon dioxide within 24 hours of the start 
of Hemolung treatment. “We're bringing the 
technology down to a level of safety, simplicity 
and superior efficacy, all with a lower level of 
invasiveness,” says Scott Morley, a biomedical 
engineer who heads product management at 
ALung. The company plans to file for Euro- 
pean approval of the Hemolung system later 
this year. 


POOR EXCHANGE 

Not everyone is convinced that lung-support 
devices offer any real advantage over mechani- 
cal ventilation for treating COPD. “Right now, 
the risk—benefit ratio of those two is very close, 
and it’s hard to tell how that’s going to play out,’ 
says Joseph Zwischenberger, a cardiothoracic 
critical-care doctor at the University of Ken- 
tucky in Lexington, who has been working on 
artificial lungs for about 30 years. “There are 
those who think the technology has evolved 
to the point where the risk of the devices is less 
than the risk of being ona ventilator, and then 
there are a large majority who say, “Nope, it’s 
just not there yet.” 

One of the biggest problems with the exist- 
ing technologies is their tendency to clot: rapid 
blood cells encounter something akin to a traf- 
fic jam as they flow into and out of the devices. 
The devices are also bulky, and only limited gas 
exchange happens as the blood passes over the 
oxygen-filled fibres. Thus, many advocates of 
lung-support systems are looking to completely 
redesign the technology and are turning to the 
science of very small volumes — microfluidics. 

Using microfluidics “enables you to get 
much more surface area for gas exchange 
per unit blood volume than the traditional 
devices,” says William Federspiel, a bioen- 
gineer at the University of Pittsburgh and a 
co-founder of ALung. “That could allow for 
a more compact device that is potentially effi- 
cient enough to run off room air,’ unlike exist- 
ing devices, which typically require a tank of 
pure oxygen. 

Some success has been achieved already. 
Air delivered at near ambient pressures was 
able to oxygenate blood in a microfluidic 
device designed by 
Joseph Potkay and 


Only apair his colleagues at Case 
of healthy Western Reserve Uni- 
human lungs versity in Cleveland, 
can reverse Ohio. In a proof-of- 
the downward principle test pub- 
progression lished in 2011 (ref. 3), 


A lung-on-a-chip (top), a lung-support machine 
(middle) and a bioreactor containing a rat lung 
removed of cells and ready for regeneration. 


Potkay and his team created a credit-card-sized 
device with artificial capillaries that achieved 
gas-exchange rates far surpassing those of cur- 
rent devices, without the need for pure oxygen. 
“We're pushing the size limits of this technol- 
ogy, says Potkay, who is now at the Veterans 
Affairs Medical Center in Ann Arbor. But 
“through [such] microfabrication, we can get 
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devices on the same size scale as the structures 
in the natural lung” 


SIZE MATTERS 

The device still has some kinks that need to be 
worked out. For one thing, it would take more 
than 1,000 of them on top of one another to pro- 
vide just one-quarter of the body’s baseline gas- 
exchange support. And Potkay’s team designed 
the device to maximize gas exchange but didn't 
optimize it to prevent clotting. By more closely 
mimicking natural blood-vessel architecture, 
however, a few research teams have designed 
microfluidic chips that circumvent the clotting 
problem, paving the way for long-term lung- 
support devices that could aid even mobile 
patients with COPD. 

Joseph Vacanti, a transplant surgeon at 
MGH, is leading one such effort. Vacanti helped 
Ott to engineer bioartificial rat lungs, but he 
is now focused on developing a microfluidic 
device. Vacanti’s group has built a device with a 
bifurcating architecture that closely matches the 
branching of blood vessels in the body*. With 
an eye to miniaturization, his group is now 
working to make the system out of collagen 
instead of silicone, a material that introduces 
much dead space into the device. “We've got 
a lot of work to do on choosing the best mate- 
rial and making it in the thinnest possible way 
so the size of the thing is either wearable or 
implantable,’ says Vacanti. 

Across the river from MGH, Jeffrey Boren- 
stein, director of the Biomedical Engineering 
Center at the Charles Stark Draper Laboratory in 
Cambridge, who has his own microfluidic lung 
prototype, is focused on optimizing another part 
of the design. To reduce the possibility of clot- 
ting, Borenstein and his colleagues are seeding 
the channels of their branching gas-exchange 
device with the cells that line blood vessels — 
endothelial cells — and, so far, they have seen no 
decline in the rates of gas transfer. “As we think 
about clinical implementation we really want to 
make this as natural a device as possible so were 
not always dealing with these blood interactions 
with synthetic surfaces,’ Borenstein says. 

Gilpin, the MGH postdoc, praises the efforts 
to develop lung-support devices and hopes they 
will soon provide some relief for people with 
COPD. But, ultimately, she says, only a pair of 
healthy human lungs can reverse the down- 
ward progression of this devastating condition. 
She leans her elbows on the lab bench, props her 
face in her hands, and gazes at the stripped set of 
lungs. “Seeing this makes me think we have the 
platform; she says. “Youd never be able to create 
this. Only nature can create this.” m 


Elie Dolgin is a news editor at Nature Medicine. 
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enigmatic traits. A progressive respiratory disorder that causes 

severe breathing difficulties, it can worsen even after patients 
stop smoking, may flare up unpredictably without apparent cause, and 
is often associated with other medical conditions such as atherosclero- 
sis, osteoporosis, renal dysfunction and cancer. Many characteristics 
of COPD could be attributable to pathologic autoimmune processes’. 
And there are increasing numbers of reports that autoantibodies 
against many different lung proteins are often present in COPD 
patients” If the underlying cause of COPD were indeed autoimmun- 
ity, new treatments might improve quality of life for patients. 

Nonetheless, scepticism about the role of autoimmunity in COPD 
still abounds and is probably justified, albeit with several qualifications. 

Some of the uncertainty may be explained by the nuanced and com- 
plex biology of autoimmunity*. Low-level reactivity to most (or maybe 
even all) of the body’s own proteins is common, and is probably criti- 
cal in preventing the immune system from mistakenly launching an 
injurious inflammatory response against our own cells and tissues. But 
in several studies, patients diagnosed with certain diseases, including 
COPD, have shown significantly higher 
levels of one or more antibodies that bind 
to the body’s own proteins, compared to 
demographically matched healthy people. 
These findings are generally considered 
evidence of autoimmunity. 

Several investigators have used varia- 
tions of this approach to identify a “COPD 
autoimmune response.” The increasing 
availability of high-throughput antigen 
array chips that can detect the presence of 
autoantibodies against thousands of pro- 
teins simultaneously, and the ease by which antigen-specific autoanti- 
bodies can be discovered, may soon spur a profusion of similar studies. 

However, not all of these autoantibodies should be considered patho- 
genic, even if their presence is abnormal. The problem lies in under- 
standing whether a particular one causes or contributes to disease. 

Autoimmunity often develops as a consequence of chronic inflam- 
mation caused by distinct disease processes’. In all but a few cases, it 
is unknown how or why initial, narrowly targeted, and appropriate 
immune responses become misdirected to attack normal tissues and 
cause disease. One plausible explanation goes that in COPD patients, 
host defences triggered by microbial infection of the airways promote the 
development of autoimmune responses’. The tobacco smoke that is the 
number-one risk factor for COPD in industrialized societies is a complex 
mix of highly reactive chemicals that can modify native proteins. These 
modified proteins may no longer be recognized as ‘self’ by the immune 
system, and can be targeted by an immune response’. 

Most autoantibodies have no apparent pathogenicity. A few, how- 
ever, are profoundly pathogenic and cause severe tissue damage’. 
The trick for researchers is to determine whether observed autoim- 
mune responses are actually pathogenic, or simply abnormal and 
harmless. Many studies have found links between autoantibodies 
and the severity of COPD. That's a good start. But because underlying 


Cen: obstructive pulmonary disease (COPD) has many 


NOT ALL OF THESE ANTIBODIES 
SHOULD BE CONSIDERED 
PATHOGENIC, EVEN IF THEIR 
PRESENCE IS ABNORMAL 


Clues, not conclusions 


Scientists have some way to go before they can prove that COPD 
should be treated as an autoimmune disease, says Steven R. Duncan. 


inflammation could be proportionally driving harmless (epiphenome- 
nal) autoimmune responses, it is a less than wholly compelling explana- 
tion. Some researchers maintain that particular COPD autoantibodies 
can be meaningful only if they are not associated with other diseases. 
In reality, however, fundamental immunologic processes are often 
shared among very different clinical syndromes. Hence, the presence of 
a superficially similar autoantibody in two or more clinical syndromes 
says little about the pathogenicity in any one of those diseases. Someday 
someone might discover the autoantibody that is the exclusive cause of 
COPD, and then the respective assay will be absolutely sensitive and 
specific for this disease. For now, though, this seems unlikely. 

In the meantime, establishing the contribution of any given auto- 
immune response to COPD requires, at the least, a demonstration 
of pathogenicity. Autoantibodies can be isolated from patients and 
tested for their effects on human cell targets in vitro’. Ifauto-immune 
responses are truly pathogenic, there should also be clear evidence of 
characteristic injuries within the target organ”. T cells have exquisite 
specificity for antigens and considerable potential pathogenicity, and 
are inert to self-antigens in healthy patients. Accordingly, the presence 
of autoreactive T-cells is highly abnormal 
and is strong evidence for the presence of 
a pathogenic autoimmune process. 

So far, however, few mechanistic stud- 
ies show specific COPD autoimmune 
responses that cause injury. Proof that a 
particular autoimmune response contrib- 
utes to COPD will probably require recreat- 
ing the disease in an animal model through 
adoptive transfers of patient-derived 
pathogenic autoantibodies or lymphocytes 
— a task that may prove difficult because 
of key differences between the immune systems of mice and humans. 

When, as many of us working on COPD anticipate, autoimmune 
mechanisms are more convincingly demonstrated to be factors in 
COPD pathogenesis, clinical trials will have the potential to yield 
unprecedented medical advances. Currently available treatments for 
COPD primarily ameliorate symptoms. Some treatments do attack 
the underlying inflammation, but not very effectively, and they do not 
dramatically alter the overall progression of the disease. But modali- 
ties to remove autoantibodies or minimize their subsequent produc- 
tion are already available, and several more agents are now being 
developed that could also work*. We may soon have the rationale and 
tools to implement novel, and potentially better, therapies for this 
extremely morbid and otherwise unremitting syndrome. = 


Steven R. Duncan is a pulmonary and critical care medicine 
physician at the University of Pittsburgh in Pennsylvania. 
email: duncsr@upmc.edu 


. Agusti, A. et al. Thorax 58, 832-834 (2003). 

. Feghali-Bostwick, C. A. et al. Am. J. Resp. Crit. Care Med. 177, 156-163 (2008). 
. Kirkham, P.A.Am. J. Respir. Crit. Care Med. 184, 796-802 (2011). 

. Marrack, P. et al. Nature Immunol. 7,899-905 (2001). 

. Cancro, M. P. etal. J. Clin. Invest. 119, 1066-1073 (2009). 


ORWNHE 


27 SEPTEMBER 2012 | VOL 489 | NATURE | S15 


Drugs to treat COPD, such as Spiriva, aim to relax the smooth muscle of the airways. 


THERAPEUTICS 


Strength in 


numbers 


Several new drugs for treating chronic obstructive 
pulmonary disease are about to hit the market, 


with more in the pipeline. 


BY DUNCAN GRAHAM-ROWE 


chest, wheezing and that desperate feeling 

that you can't get enough air into your 
lungs: these are all familiar to asthma sufferers. 
For people with chronic obstructive pulmonary 
disease (COPD), however, such symptoms are 
not fleeting. They are constant and aggra- 
vated by a steady build-up of sputum, shallow 
breathing anda persistent cough. The most that 
patients can hope for in terms of treatment is 
merely to manage their symptoms as they 
steadily worsen over time. 

COPD, unlike asthma, is associated with 
tobacco smoking and long-term exposure to 
airborne toxicants, which irreparably damage 
the lungs, causing inflammation and airway 


Sa of breath, a tightening of the 


narrowing. But traditionally, COPD has been 
treated in much the same way as asthma. For 
relief, patients can inhale short-acting broncho- 
dilators, which relax the airways: for example, 
salbutamol, which is marketed as Ventolin by 
the London-based drug giant GlaxoSmith- 
Kline (GSK). Inhaled corticosteroids also 
reduce inflammation in the lungs and there- 
fore decrease the risk of acute episodes known 
as exacerbations, which can leave patients hos- 
pitalized for days. For managing symptoms 
over the longer term, there are bronchodilator- 
corticosteroid combinations such as salmeterol- 
fluticasone propionate (Advair; also marketed 
by GSK), which can ease symptoms for up to 12 
hours. Another long-term option is tiotropium 
bromide (Spiriva), which acts by a different 
bronchodilation mechanism to provide 24-hour 
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relief, and is marketed by pharmaceutical com- 
panies Boehringer Ingelheim, based in Ger- 
many, and Pfizer, based in Groton, Connecticut. 

Now, several new drugs for COPD treatment 
are on the verge of being approved by regula- 
tory authorities (see ‘Long-acting, daily medi- 
cations). The new candidates are not cures, but 
they may offer genuine hope that COPD suffer- 
ers can have a better quality of life. 


FAST ACTION 

On the surface, the offerings look like sim- 
ple replacements for Spiriva and Advair. The 
closest to market is glycopyrronium bromide 
(NVA237; Seebri), which is awaiting approval 
in Europe and has been developed by the phar- 
maceutical giant Novartis, headquartered in 
Basel, Switzerland. Like Spiriva, this once-daily 
drug acts on a set of nerves in the smooth mus- 
cle around the airways. “Acetyl-choline acts on 
muscarinic receptors found in the muscles sur- 
rounding the airways, causing the muscle in the 
airways to contract and the airways to narrow,” 
explains Dave Morris, who heads global devel- 
opment for primary care at Novartis. In patients 
with COPD, the new drug competes with ace- 
tylcholine, blocking the receptors and prevent- 
ing the airways from closing up. It is therefore 
classified, together with Spiriva, as a long-acting 
muscarinic antagonist (LAMA). 

Novartis has carried out three large-scale 
phase III trials of Seebri. In May 2012, it reported 
the results of the second trial, GLOW2, at the 
American Thoracic Society International Con- 
ference in San Francisco, California. GLOW2 
was conducted over one year and involved 1,066 
individuals: Seebri not only improved lung func- 
tion to the same extent as its analogue, Spiriva, 
but also took effect quicker. At both 5 minutes 
and 15 minutes after inhalation, it produced a 
doubling in FEV,, the amount of air that can 
be exhaled in 1 second (as measured by blow- 
ing into a spirometer). It has also been clinically 
shown to produce greater bronchodilation in the 
first 4 hours of use than Spiriva. 

GSK is taking a different tack. In July, 
together with Theravance, a biopharmaceutical 
company based in South San Francisco, Cali- 
fornia, it applied for regulatory approval in both 
Europe and the United States for a new drug 
combination: vilanterol—fluticasone furoate 
(previously known as Relovair but now called 
Relvar or Breo). Relvar is similar to Advair 
in that it consists of a long-acting B2-agonist 
(LABA) bronchodilator and an inhaled cor- 
ticosteroid. Like Seebri (and Spiriva), Relvar 
works on the nerves in the smooth muscles, but 
it does so by stimulating §2-adrenergic recep- 
tors, which triggers a biochemical cascade that 
leads to the relaxation of the smooth muscle in 
the airways. In phase III trials, Relvar improved 
lung function compared with placebo or Advair 
when taken over a 12-week period, as measured 
by standard spirometry tests. 

GSK has not yet published numbers showing 
the extent of the improvement. But, according 
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LONG-ACTING, DAILY MEDICATIONS 


Do combination therapies really add value to COPD treatments? The pros and cons of long acting therapies in the pipeline. 


Name Main benefit Type of drug Active components Delivery method 

Advair Improves lung function Long-acting B2-agonist | Fluticasone/ Dry powder 
for a period of time (LABA) and corticos- salmeterol inhaler 

teroid 

Spiriva Improves lung function | Long-acting muscarinic | Tiotropium bromide Dry powder 
for a period of time antagonist (LAMA) inhaler 

PTOO3 Efficient delivery, LAMA + LABA Glycopyrrolate and Metered dose 
improved lung function | (two molecules) formoterol inhaler (MDI) 

(LAMA + LABA) 

NVA237 | Improves breathing LAMA Glycopyrronium Dry powder 
in a matter of minutes bromide (LAMA) inhaler 

Relvar Once daily instead LABA + inhaled Vilanterol and Dry powder 
of twice corticosteroid fluticasone furoate inhaler 

MABA Dual action molecule LAMA + LABA MABA Most likely dry 
improves lung function powder inhaler 


to a spokesperson, Relvar’s effects last longer 
than those of Advair, so it need only be taken 
once a day rather than twice. A once-daily 
dose should significantly increase compliance. 
“Patients frequently don't take their second 
dose,’ says James Donohue, a pulmonary dis- 
eases specialist at the University of North Caro- 
lina at Chapel Hill, who has been involved in 
trials of Spiriva and has worked as a consultant 
to both GSK and Novartis. 


DYNAMIC DUOS 

Patient compliance concerns are also driv- 
ing the development of combination thera- 
pies — the rationale being that patients find it 
easier to take a single medication. Combination 
therapies also have another advantage. Inhaled 
corticosteroids don't seem to have strong anti- 
inflammatory effects when they are used in iso- 
lation. When used in conjunction with separate 
bronchodilators, however, they've been shown 
to reduce the frequency of exacerbations. This 
synergistic effect probably arises because “the 
6-agonist facilitates the entry of the steroid into 
the [cell's] nucleus’, Donohue says. So delivering 
them as part of a combination therapy has the 
potential to optimise this synergy. 

Another effect with ramifications for treat- 
ing COPD involves the simultaneous use of a 
LAMA anda LABA. The idea is to open the air- 
ways further by simultaneously switching off the 
nerves that tighten the passages while stimulat- 
ing the ones that relax them, says Chris Cates, 
a population health researcher at St George's 
University of London who studies COPD. 

Such LAMA-LABA combinations are being 
developed by a company called Pearl Therapeu- 
tics, based in Redwood City, California. Pearl has 
already carried out eight clinical trials on a com- 
bination therapy called PT003, and phase III tri- 
als are set to commence in the first half of 2013, 
says Colin Reisner, Pearl’s chief medical officer. 

PT003 is notable not so much for its chemistry 


as for its novel method of delivery, says Donald 
Tashkin, medical director of the Pulmonary 
Function Laboratory at the David Geffen School 
of Medicine, University of California, Los Ange- 
les, who has consulted for the company. Both the 
LAMA and LABA components — glycopyrro- 
late and formoterol fumarate, respectively — 
are established drugs. But PT003 delivers them 
together, bound to particles with a diameter of 
less than 5 micrometers. In this way, the drug 
molecules are suspended in solution and can 
be dispensed through a metered dose inhaler, a 
device that many patients find easier to operate 
than the commonly used dry powder inhalers. 
The molecular vehicle also provides an efficient 
means of delivery, ensuring that around half of 
the drug reaches its target, as opposed to as little 
as 5% with some delivery systems, says Tashkin. 

Synergistic effects can occur in the lungs even 
when LABAs and LAMAsare inhaled separately, 
but one after the other. However there is preclini- 
cal evidence that the approach of administering 
them simultaneously 
through a single 


Doses Adverse effects Stage 
per day 
2 Increased risk of Available 
non-fatal pneumonia 
1 Hives, rash, swelling Available 
and dry mouth 
2 Headache, dry mouth | Phase Il com- 
and coughing pleted and Phase 
Ill to begin 2013 
il Headache, dry mouth | Phase Ill 
and coughing completed and 
approval sought 
in Europe 
1 Fatal pneumonia Phase III 
reported completed and 
approval sought in 
USA and Europe 
Unknown | Unknown Phase II 


bronchodilator by itself? says Baker. “However, 
we also see the opportunity for a triple-mecha- 
nism action with an inhaled corticosteroid” 

Ultimately, this is where the greatest value 
may lie — not in the pharmacological precision 
ofa single molecule with a dual mechanism of 
action, but rather in the ease with which it could 
pass through the drug regulatory system. “The 
beauty of this is when it is approved, MABA 
would count as one drug,” says Baker. 

In principle, companies hoping to develop 
a triple therapy by adding one drug to this 
MABA would therefore have a low regulatory 
bar, says Tashkin. Normally, triple combina- 
tion therapy would require a six-way study — 
demonstrating improvement over not just the 
individual constituents but also their various 
combinations — but combining the MABA 
with an inhaled corticosteroid would require 
a simpler burden of proof. 

The big question, says Cates, is whether the 
costs outweigh the benefits. For instance, add- 
ing corticosteroids may lower the risk of exac- 
erbations but increase the risk of pneumonia. 
Indeed, there have been fatal cases of pneumo- 


“The big device, as for PT003, 
question of drug may enhance their 
combinationsis — effect even further, 
whether the says Darrell Baker, a 
costs outweigh senior vice president 


the benefits.” 


who oversees GSK’s 


nia associated with Relvar (although it is not the 
only drug to carry that potential risk). Accord- 
ing to Baker, the company is likely to market 
the drug at lower doses than those given to trial 
subjects who developed pneumonia. 


respiratory portfolio. 

If this is the case, then another novel drug, 
being developed jointly by GSK and Thera- 
vance, could prove successful. The drug is a 
single molecule with both LAMA and LABA 
properties. Phase II clinical trials of this mus- 
carinic antagonist-B2 agonist (MABA) have 
been promising, says Baker, which when com- 
pared to salmeterol “improved bronchial dila- 
tion across a range of doses”. 

Even so, no trial data have been released so 
far, and the company’s plans for this MABA are 
unclear. “We are keen to see how it works as a 


Although many of these new drug can- 
didates hold promise and have short-term 
benefits, none of them has yet been shown to 
improve lung function year on year, says Cates. 
So until drug companies start to find ways to 
help repair damaged lungs and reverse the 
effects of COPD, there is only one long-term 
‘treatment’ available to patients, he adds, and 
it does not involve taking drugs but abstaining 
from them: quitting smoking. = 


Duncan Graham-Rowe is a freelance science 
writer based in Brighton, UK. 
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The Pearl River runs through Guangzhou in China, where high levels of smog increase the incidence of COPD. 


PUBLIC HEALTH 


Where there’s smoke 


Air pollution and smoking have made COPD a major problem in China, now compounded 
by outdated diagnostics and treatments — and experts say it’s bound to get worse. 


BY VIRGINIA HUGHES 


visitor to China may well notice the 
Av: smog problem as the plane 
descends. Smog levels in large cities 
such as Beijing and Shanghai frequently dwarf 
those of other metropolitan centres. Then 
there’s the cigarette smoke. China, the world’s 
most populous country, claims about one-third 
of the world’s smokers — at least 300 million 
people — who collectively puff 1.7 trillion ciga- 
rettes a year. In rural areas, cigarette smoke per- 
meates buses, shops and even doctors’ offices. 
Beyond cigarette smoke and outdoor air pol- 
lution, hundreds of millions of Chinese people 
breathe unclean air while working in factories 
and on industrial-scale farms or while cooking 
at wood-burning stoves inside their homes. 
These airborne toxicants — many of which 
are by-products of China’s economic boom 
— are risk factors for chronic obstructive pul- 
monary disease (COPD), an incurable respira- 
tory disorder that can cause severe breathing 


difficulties. And they have public-health 
officials worldwide worried about a coming 
epidemic. “We're just seeing the tip of the ice- 
berg on COPD in China,” says Don Sin, a res- 
piratory medicine specialist at the University of 
British Columbia in Vancouver, Canada, who 
researches COPD. “In 30 years, [the number of 
cases] is going to explode” 

A large-scale study in China put the preva- 
lence of COPD in 2004 at roughly 8% in people 
who are 40 or older’, in line with rates in the 
United Kingdom and the United States. But 
because China has seen soaring rates of indus- 
trialization and tobacco use over the past few 
decades, and because COPD symptoms aren't 
typically noticed until after age 50, public- 
health experts say the future is bleak. According 
to Sin, projections based 


on current trends and DNATURE.COM 
World Health Organiza- Nature China 
tion estimates show that _ covers the best of 
by 2030, COPD will kill3 Chinese research: 
million Chinese people a —_ go.iatuire.com/xuaofu 
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year — a million more than die annually now. 


SMOKE SIGNALS 

Since the late 1970s, disposable income has 
been on the rise in China and so has the Chinese 
tobacco industry, which has ramped up produc- 
tion and advertising. (However, recent laws have 
banned cigarette advertisements on television, the 
radio and in newspapers.) Cigarettes are cheap: as 
little as 30 US cents per pack, says Sin. 

Because ofa strong cultural stigma, only about 
one in fifty Chinese women smoke. But more 
than half ofall men, and about two-thirds of mid- 
dle-aged men, do. Smoking is popular among 
men in the city and the country, among those of 
all educational levels. And a survey of doctors in 
six Chinese cities, published in 2007, found that 
41% of male doctors smoke, and about 15% have 
smoked in front of their patients’. 

Smoking-cessation programmes exist, but 
their impact is minimal. Chunxue Bai, director 
of the Respiratory Research Institute at Fudan 
University, in Shanghai, says he has launched 
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smoking-cessation clinics at 58 hospitals across 
China. But the clinic at his own hospital sees 
only about 300 people a year. 

Despite the known link between smoking 
and COPD, most Chinese smokers do not fully 
appreciate the harm caused by smoking or are 
unconcerned. “They have some knowledge, 
but it doesn’t go deep enough to change their 
behaviour,’ says Frank Hu, a nutrition special- 
ist and epidemiologist at the Harvard School of 
Public Health in Boston, Massachusetts, who 
grew up in China. 

China's high smoking rates explain only part 
of its COPD problem though, because non- 
smokers are also at risk. For example, even 
though few women smoke, women still have 
high rates of COPD. 

One explanation is that women — 82.5%, 
according to one study’ — are exposed to 
second-hand cigarette smoke. Two large stud- 
ies in China have pointed to a link between 
this ‘passive smoking’ and COPD. The first of 
these studies, published in 2007, relied on the 
Guangzhou Biobank Cohort Study, a collec- 
tion of blood samples and extensive medical 
data from more than 20,000 people over age 50 
from Guangzhou, the largest city in southern 
China. In this group, the longer the duration of 
exposure to second-hand smoke, the higher the 
likelihood of COPD. The authors of this study 
also estimated that, of the 240 million people in 
China who were 50 years or older at the time of 
the study, 1.9 million non-smokers would die 
because of second-hand smoke’. 

Similarly, the second study, published in May 
2012 by Hu and his colleagues, followed 910 
non-smoking Chinese women for 17 years. 
They found that women who were exposed to 
second-hand smoke were 2.3 times more likely 
to die from COPD than those who were not 
exposed, with cumulative exposure increasing 
the risk’, 

But these data are controversial. Studies of 
second-hand smoke in Western countries have 
shown a suggestive but not definitive link to 
COPD. Three other studies in China also did 
not find an association between second-hand 
smoke and COPD. And COPD rates are up to 
three times higher for non-smoking Chinese 
women who live in rural areas than for non- 
smoking Chinese women who live in cities, 
even though their levels of second-hand smoke 
exposure are similar. 

“There’s no question that second-hand 
smoke is bad, but the [estimate] for how much 
COPD it causes in China is still not known,’ says 
David Christiani, an environmental geneticist at 
the Harvard School of Public Health. “Tt’s much 
more than just a simple linear connection?” 

Christiani became interested in the non- 
smoking causes of COPD in the late 1970s, 
when US health officials were pushing for 
regulations to limit workers’ exposure to cotton 
dust in textile factories. The process of turning 
fluffy cotton flowers into strong thread throws 
off particles of plant matter, pesticides and 


COPD IN CHINA: WHERE IT'S WORST 


In four of six provinces included in one study’, the 
disease was more common in the countryside, which 
has its own risk factors — such as home cook stoves 
that throw off toxins and poor healthcare. 
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bacterial endotoxin, and textile workers 
had high levels of respiratory disease. But, at 
the time, the US cotton industry suggested 
that workers’ lung problems stemmed from 
cigarette smoking rather than conditions at 
the mills. “Most workers were smokers, so 
it was very hard to tease these things apart,” 
Christiani says. 

Thanks to a serendipitous partnership with a 
university in Shanghai, Christiani realized that 
China offered a natural experiment to settle the 
debate. In Chinese factories, men and women 
work side by side; however, whereas most of 
the men smoke, hardly any of the women do. 

For the past 30 years, Christiani has fol- 
lowed hundreds of people working at two 
cotton mills and one silk mill in Shanghai. 
His research has shown that even in non-smok- 
ers, breathing cotton dust causes significant 
airway blockage. 

Although China has passed some labour 
laws to limit exposure to cotton dust, they’re 
often not enforced. And many other indus- 
tries pose similar threats or worse ones. Coal 
mining, welding, construction, grain process- 
ing and animal breeding produce their own 
airborne toxicants — including silica crystals, 
asbestos and faecal matter — all of which can 
contribute to COPD. 

The impact of these occupational hazards 
is evident even in the layout of Chinese hospi- 
tals, notes Peymané Adab, an epidemiologist at 
the University of Birmingham in the UK and 
an investigator on the Guangzhou Biobank 
Cohort Study. “There are specific wards for 


people who have been exposed to certain dusts 
or people with lung disease related to a particu- 
lar toxin,” she says. 

Another key factor is indoor air pollution. 
More than 70% of all Chinese households, and 
90% of rural ones, use stoves that run on wood, 
crop residues, coal or animal dung. Burning 
these fuels releases particulate matter, as well as 
fumes laced with carbon monoxide, formalde- 
hyde and free radicals. In 2007, China’s Minis- 
try of Environmental Protection set emissions 
standards for pollutants inside the home, but 
studies that measure air quality in rural homes 
have found levels exceeding the standards many 
times over. And exposure to biomass smoke 
increases COPD risk by 2.4 times, according 
toa 2010 meta-analysis of 15 studies®. “Women 
being the predominant cooks for families, they 
would be exposed to more of that,’ says Sin. 
That might partly explain why the COPD rates 
in non-smoking Chinese women are higher in 
rural areas than in cities — the dangerous types 
of stove are more common in the countryside. 

Sin points to a slew of other factors too. 
About 1.5 million people in China have tuber- 
culosis, and many areas of China have high 
rates of childhood pneumonia and malnutri- 
tion. All three factors increase the risk of COPD 
later in life. 


DEVICES WANTED 

As recently as a decade ago, COPD research in 
China was scant. Although awareness has grown 
considerably in the past few years, scientists still 
have trouble gauging the full extent of the disease. 

COPD prevalence ranges widely from region 
to region, from 5.5% in the province of Shanxi 
to 13.7% in the city of Chongqing, according 
to the large prevalence survey of 2004". Other 
studies have found even greater discrepancies 
(see ‘COPD in China: Where it’s worst’). 

One reason for this variability is inconsist- 
ency in the methods of diagnosis. Spirometry, 
in which the patient blows as hard as possible 
for as long as possible into a machine that meas- 
ures lung capacity, is the global gold standard. 
But spirometers are rarely used in China, espe- 
cially outside the big cities. Even though these 
machines are relatively inexpensive for special- 
ized medical equipment — several thousand 
dollars apiece — many doctors, especially in the 
countryside, do not use them. The large COPD 
prevalence study found that less than 7% of par- 
ticipants diagnosed with COPD had previously 
been tested by spirometry’. 

Instead, Chinese doctors tend to rely on sub- 
jective descriptions of symptoms to make the 
call, which can lead to misdiagnosis as asthma 
or anon-chronic form of bronchitis. “People 
go to see their doctor complaining of short- 
ness of breath, and responses would be, “You're 
getting old’ or “You have bronchitis, take some 
antibiotics, Sin says. 

Even for individuals who receive a bona fide 
COPD diagnosis, medical care and drug treat- 
ments can be lacking. In Western countries, 
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One study found that fewer than 7% of COPD patients in China had been tested by spirometry. 


common treatments for COPD include several 
kinds of inhalants, such as corticosteroids 
and B-agonists, that open up the airways in 
the lungs and provide immediate relief from 
symptoms. In the later stages of the disease, 
many patients use portable oxygen tanks. But 
these options are far too expensive for most 
Chinese patients. 

In China, the most common COPD treat- 
ments are two inexpensive tablets. One, 
called carbocisteine, helps break up mucus 
and phlegm in the respiratory tract and has 
antioxidant properties that may slow the pro- 
gression of COPD (see ‘A radical treatment’, 
page S4). The other, called theophylline, is a 
decades-old bronchodilator, or respiratory 
muscle relaxant, and anti-inflammatory. Theo- 
phylline is structurally similar to caffeine and 
is known for its erratic absorption and harsh 
side effects, including nausea, headache and 
life-threatening changes to heart rhythms. 

Chinese researchers are focused on rigor- 
ously testing the effectiveness of their small 
drug arsenal. Studies of Western patients have 
found little benefit to adding carbocisteine or 
theophylline to other, reportedly more effec- 
tive, treatments. But, in China, where most 
patients use nothing else, research has shown 
that the two drugs can help. For example, a 
2008 report found that in Chinese patients with 
COPD, carbocisteine treatment for one year 
reduced the number of exacerbations, or sus- 
tained disease flare-ups, better than a placebo’. 
The same was found for theophylline given in a 
slow-release formulation at doses low enough 
to prevent severe side effects’. 

Both drugs are much cheaper than inhal- 
ants. In 2008, the average annual cost of car- 
bocisteine therapy in China was 650 renminbi 
(about US$90 at the time) for each patient, 
compared with 4,320 renminbi ($580) for 
treatment with the standard combination of 


inhaled medications used in Western countries. 
Yet even the cheap drugs present an economic 
burden for most families, who shoulder the 
bulk of their healthcare costs without insurance 
or government assistance. One study found 
that the total cost for a patient with COPD in 
China was US$1,732 a year — a tremendous 
amount, given that the average urban Chinese 
household pulled in only $3,000 per capita in 
2010, according to an analysis of government 
statistics by the China Market Research Group. 

Beyond cost, the difference in quality of 
life is enormous. 
The inhalers used 


“Only about one. 

pith in Western coun- 
inf ifty aecinna tries slow down lung 
smoke, butmore deterioration and 
than half of all give patients years of 
men do. 


additional mobility. 
They are also better 
at preventing exacer- 
bations, which cause intense anxiety and often 
take weeks to recover from. These flare-ups 
often mean time off from work, making treat- 
ment costs even harder to bear. 


HAZY FUTURE 
For now, inhalers are available only to wealthy 
Chinese citizens. But, as China’s economy con- 
tinues to grow, so too will its middle class — 
and its market for more expensive drugs. 
Pharmaceutical giants have taken notice. 
A spokesperson for Novartis, headquartered 
in Basel, Switzerland, says the company 
recently approached a regulatory agency in 
China about one of its inhalers, a once-a-day 
bronchodilator. Jinping Zheng, deputy direc- 
tor of the Guangzhou Institute of Respiratory 
Disease, says that London-based AstraZeneca 
is investigating COPD in China. And he says 
that four years ago, GlaxoSmithKline (GSK), 
also headquartered in London, launched 
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COPD Academy, a programme that has 
reached more than 20,000 general practition- 
ers across China. A third of the programme 
focuses on a COPD drug called Seretide, a 
combined anti-inflammatory and bronchodi- 
lator that is marketed by GSK. The other two- 
thirds — developed with the help of the top 
18 respiratory experts in China, according to 
GSK — aims to deliver unbiased education on 
the basics of COPD diagnosis. 

There has also been some regulatory move- 
ment on the antismoking front. In the past 
few years, major cities in China have banned 
smoking in hospitals, restaurants, public trans- 
port and other indoor public places, and a 
similar nationwide ban was instituted in 2011. 
These laws are difficult to enforce, however, 
and many smokers blatantly ignore them. 

Smoking regulations promoted by promi- 
nent doctors and China’s Ministry of Health 
have a powerful enemy in the Chinese tobacco 
companies, which are controlled by the 
government at the local and national levels. 
“The tobacco companies are much stronger 
[than the health department] because they have 
the money,’ says Tai Hing Lam, director of the 
School of Public Health at the University of Hong 
Kong. “That's why the progress has not been as 
good as we would like” Lam and other experts 
say that China’s COPD problem will not be 
curbed until the public makes demands for more- 
sweeping political changes from its government. 

There is a precedent. It happened in the 
summer of 2008, when the Olympic Games 
were held in Beijing, and China shut down 
dozens of nearby factories and ordered half of 
all private cars off the road. The Beijing gov- 
ernment also passed smoking bans in schools, 
hospitals and government offices, and required 
non-smoking sections in hotels and restau- 
rants. Air pollution dropped quickly. 

But as soon as the festivities were over, 
business continued as usual, and many of the 
smoking bans were lifted. 

All China needs is a wake-up call to realize 
the significant human and economic burden of 
COPD, according to Christiani. “Once China 
starts recognizing the problem as something 
that needs to be tackled, it can make some 
significant strides.” But for millions of older 
Chinese people, any hope of avoiding COPD 
risks has gone up in smoke. m 


Virginia Hughes is a freelance science writer 
based in Brooklyn, New York. 
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